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This application is a continuation of International Application No. 
PCT/US02/08288, which designated the United States and was filed 14 March 2002, 
published in English, which claims the benefit of U.S. Provisional Application No. 
60/314,046, filed on August 22, 2001, and U.S. Provisional Application No. 
60/322,993, filed on September 18, 2001. 

The entire teachings of the above applications are incorporated herein by 
reference. 

GOVERNMENT SUPPORT 

The invention was supported in part, by the National Institutes of Health grant 
numbers AI45022 and All 9807. The Government has certain rights in the invention. 

BACKGROUND OF THE INVENTION 

Apoptosis is a morphologically stereotyped form of programmed cell death 
utilized by metazoan organisms during normal development as well as for homeostasis 
(Kerr et al, 1972; Horvitz et al, 1982; Steller, 1995; Vaux and Korsmeyer, 1999). In T 
lymphoid development, oc(3 T cell receptors (TCR) are generated by a stochastic process 
of rearrangement of variable gene segments. Through combinatorial diversity, billions 
of different TCRs are generated ensuring protection against a myriad of infectious 
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agents without a requirement to encode all TCR specificities in the germline (Davis and 
Bjorkman, 1988). One potential hazard of this diversity-generating mechanism is the 
creation of self-reactive TCRs. Those TCRs interacting too strongly with self-proteins 
bound to MHC molecules are deleted (negative selection) and those weakly reactive are 
5 preserved (positive selection). However, the processes influencing the selection of 
immune reactive T cells for development are poorly understood. 

SUMMARY OF THE INVENTION 

Work described herein details the identification of a novel gene regulated during 
the negative selection of immature CD4 + CD8 + double positive (DP) thymocytes. A 

10 systematic method of analyzing differentially expressed genes using TCR transgenic 
mice has resulted in the discovery of a negative selection gene, termed IkBNS, in the 
mouse and its homolog in the human. The IkBNS protein contains seven ankyrin 
repeats and it is homologous to known IkB family members but lacks ubiquitination- 
based degradation signals. In class I and class II MHC-restricted TCR transgenic mice, 

1 5 transcription of IkBNS is stimulated by peptides that trigger negative selection but not 
by those inducing positive selection (i.e., survival) or by non-selecting peptides. IkBNS 
blocks transcription from NF-kB reporters, alters NF-kB electrophoretic mobility shifts 
and interacts with NF-kB proteins in thymic nuclear lysates following TCR stimulation. 
The present invention relates to an isolated nucleic acid molecule consisting of a 

20 nucleic acid sequence selected from the group consisting of SEQ ID NOS: 1 and 3, the 
complement of SEQ ID NOS: 1 and 3 and nucleic acid sequences that encode SEQ ID 
NOS: 2 and 4. The invention also relates to an isolated nucleic acid molecule consisting 
of exons I to XIII of SEQ ID NO: 1 in a contiguous sequence, or combinations or 
permutations of exons I to XIII of SEQ ID NO: 1 in a contiguous sequence. The 

25 invention further relates to an isolated portion of SEQ ID NOS: 1 and 3, an isolated 
portion of the complement of SEQ ID NOS: 1 and 3, and an isolated portion of nucleic 
acid sequences that encode SEQ ID NOS: 2 and 4, wherein such portion is of sufficient 



1062.2002-004 



-3- 

length to distinctly characterize the sequence. For example, the isolated portion can be 
from about 10 to 25 nucleotides in length, from about 25 to 40 nucleotides in length, 
from about 40 to 200 nucleotides in length or from about 200 to 500 nucleotides in 
length. In a preferred embodiment, the portion is greater than 500 nucleotides in length. 
5 The nucleic acid sequences of the invention can be DNA or RNA, single 

stranded or double stranded, sense or anti-sense, coding or non-coding sequence. 

The present invention also relates to an isolated nucleic acid molecule 
comprising a nucleic acid sequence selected from the group consisting of SEQ ID NO: 

3, the complement of SEQ ID NO: 3, a nucleic acid sequence that encodes SEQ ID 
10 NOS: 2 and 4, and to an isolated portion of SEQ ID NO: 3, an isolated portion of the 

complement of SEQ ID NO: 3, and an isolated portion of a nucleic acid sequences that 
encode SEQ ID NOS: 2 and 4, wherein such portion is of sufficient length to distinctly 
characterize the sequence. The present invention further relates to an isolated nucleic 
acid molecule comprising a nucleic acid sequence that encodes exons I to XIII of SEQ 

1 5 ID NO: 1 in a contiguous sequence, or combinations or permutations of exons I to XIII 
of SEQ ID NO: 1 in a contiguous sequence. 

The invention further relates to an isolated nucleic acid molecule comprising a 
nucleic acid sequence that hybridizes under high stringency conditions to a nucleic acid 
sequence selected from the group consisting of SEQ ID NOS: 1 and 3, the complement 

20 of SEQ ID NOS: 1 and 3, and nucleic acid sequences that encode SEQ ID NOS: 2 and 

4. The invention further relates to an isolated nucleic acid molecule hybridizing under 
high stringency conditions to a nucleotide sequence selected from the group consisting 
of an isolated portion of SEQ ID NOS: 1 and 3, an isolated portion of the complement 
of SEQ ID NOS: 1 and 3, and an isolated portion of nucleic acid sequences that encode 

25 SEQ ID NOS: 2 and 4, wherein such portion is of sufficient length to distinctly 
characterize the sequence. In a further embodiment, a nucleic acid molecule of the 
invention comprises a nucleotide sequence which is greater than about 75 percent, more 
preferably greater than about 80 percent, and even more preferably greater than about 90 
percent, identical to a nucleotide sequence selected from the group consisting of SEQ ID 
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NOS: 1 and 3, the complement of SEQ ID NOS: 1 and 3, nucleic acid sequences that 
encode SEQ ID NOS: 2 and 4, an isolated portion of SEQ ID NOS: 1 and 3, an isolated 
portion of the complement of SEQ ID NOS: 1 and 3, and an isolated portion of nucleic 
acid sequences that encode SEQ ID NOS: 2 and 4, wherein such portion is of sufficient 
5 length to distinctly characterize the sequence. 

The invention also relates to a probe comprising a nucleic acid sequence that 
hybridizes under high stringency conditions to a nucleic acid sequence selected from the 
group consisting of SEQ ID NOS: 1 and 3, the complement of SEQ ID NOS: 1 and 3, 
nucleic acid sequences that encode SEQ ID NO: 2 and 4, a portion of SEQ ID NO: 1 

10 and 3 5 and a portion of the complement of SEQ ID NOS: 1 and 3, wherein such portion 
is of sufficient length to distinctly characterize the sequence. In preferred embodiments, 
the fragment is useful as a probe, or primer and is at least 15, more preferably at least 
18, and even more preferably 20-25, 30, 50, 100, 200 or more nucleotides in length. 
The invention also relates to DNA constructs comprising a nucleic acid 

15 molecule as described herein, operatively linked to a regulatory sequence, as well as to 
recombinant host cells, such as bacterial cells, fungal cells, plants cells, insect cells, 
avian cells, amphibian cells and mammalian cells, comprising the nucleic acid 
molecules described herein, preferably operatively linked to a regulatory sequence. The 
invention further relates to methods for preparing proteins and polypeptides encoded by 

20 the isolated nucleic acid molecules described herein, comprising culturing the 
recombinant host cell, which comprises an isolated nucleic acid molecule to the 
invention operatively linked to a regulatory sequence. 

The invention further relates to isolated proteins and polypeptides, or functional 
portion thereof, encoded by the nucleic acid molecules of the invention. In particular, 

25 the invention related to a protein or polypeptide having an amino acid sequence selected 
from the group consisting of SEQ ID NOS: 2 and 4. In addition the invention relates to 
proteins or polypeptides of the invention fused to another component, including but not 
limited to a GST protein, FLAG-tags, hemophilus influenza hemaglutinin tag or tags to 
direct cellular localization. 
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The invention also pertains to antibodies, including monoclonal and polyclonal 
antibodies, or antigen-binding fragments thereof, that selectively bind to the proteins 
and polypeptides of the invention, or portions thereof. Furthermore, the invention 
relates to a method for assaying the presence or absence of the protein or polypeptide, or 
5 portion of the protein or polypeptide, of the invention (e.g. encoded by a nucleic acid of 
the invention) in a sample, for example in a tissue sample, comprising contacting said 
sample with an agent (e.g. an antibody) which specifically binds to the encoded protein 
or polypeptide and detecting the formation of a complex between the protein or 
polypeptide and the agent. 

10 Furthermore, the invention relates to a method to identify protein-interaction 

partners that interact with protein or polypeptide of the invention, or functional portion 
thereof, comprising the steps of contacting the protein or polypeptide with the agent to 
be tested and assaying for presence or absence of complex formation between the 
polypeptide and agent. 

15 The invention further relates to a method of screening for an agent that is an 

agonist, mimic or antagonist or the proteins or polypeptides of the invention comprising 
the steps of contacting the protein or polypeptide, or functional fragment thereof, with 
the agent to be tested, and determining the level of activity of the polypeptide in the 
presence of the agent. Comparison of the activity of the protein or polypeptide in the 

20 presence of the agent with the level of activity of the protein or polypeptide in the 
absence of the agent determines if the agent is capable of altering the activity of the 
polypeptide. In a preferred embodiment, the activity of the protein or polypeptide is the 
modulation of NFicB-induced gene expression. 

Non-human transgenic animals are also included in this invention. Such animals 

25 are transgenic for a nucleic acid sequence selected for the nucleic acid sequences of the 
invention, or nucleic acid sequences encoding a polypeptide of the invention. 
Furthermore, the non-human transgenic animals can be used to identify agents that 
alters the activity of a polypeptide molecule of the invention, or functional fragments 
thereof, comprising exposing a transgenic animal to the agent to be tested and 
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determining the level of activity of the polypeptide or functional fragment thereof, 
wherein increased activity indicates that the agent is an agonist. In contrast, when 
decreased activity of the polypeptide, or functional fragment thereof, is detected, the 
agent is an antagonist. Preferably, the activity of the protein or polypeptide is 
5 modulation of NFicB-induced gene expression or binding to NFkB. In another 
embodiment of the invention, non-human transgenic animals that have all, or a 
fragment, of a nucleic acid molecule of the invention deleted are also envisioned. 

The invention further relates to a method to identify gene targets of the 
polypeptides of the invention, comprising comparing the gene expression in a non- 
10 human transgenic animal that express a polypeptide of the invention with a non-human 
transgenic animal that does not express the same polypeptide, wherein genes identified 
to be modulated in association with expression of said polypeptide, and which are not 
modulated when the same polypeptide is not expressed, are potential targets of the 
polypeptide. 

1 5 The invention also relates to a method to identify gene targets of the 

polypeptides of the invention, or functional fragments thereof, comprising comparing 
the gene expression in a recombinant host cell containing a polypeptide of the present 
invention with gene expression of a host cell that does not express the polypeptide, 
wherein genes identified to be specifically expressed in association with expression of 

20 the polypeptide, are potential targets of the polypeptide. 

Furthermore, the invention relates to a method of treatment of an individual 
having a disorder comprising administering a therapeutically-effective amount of an 
agent that modulates the activity of the polypeptide of the invention. In a preferred 
embodiment, the disorders include, but are not limited to an autoimmune disease and 

25 the agent that modulates the activity of the polypeptide is an agonist. In another 
preferred embodiment, the disorders include, but are not limited to cancer, malaria, 
tuberculosis, and HIV infection and the agent that modulates the activity of the 
polypeptide is an antagonist. Administration of the agent can be performed by any 
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suitable method, including but not limited to orally, intravenously, intramuscularly, 
subcutaneously, topically, rectally, or by inhalation. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The patent or application file contains at least one drawing executed in color. 
5 Copies of this patent or patent application publication with color drawing(s) will be 
provided by the Office upon request and payment of the necessary fee. 

Figure 1 shows the scheme of N15tg RAG-2" /_ DP thymocyte interactions with 
selecting versus nonselecting thymic stromal cells. In the presence of H-2 b class I MHC 
bearing stromal elements, the exogenously administered VSV8 peptide binds Kb 

10 molecules forming a pMHC complex (step 1). DP thymocytes then recognize this 

pMHC complex in the N15 TCRtg RAG-2"'" H-2 b mice via their TCR (step 2). Immune 
recognition triggers a TCR signal (step 3) resulting in the induction by the specific 
VSV8/Kb pMHC ligand of negative selection in those thymocytes (step 4). In contrast, 
in the N15 TCRtg RAG-2 7 ' H-2 d mice, there is no N15 TCR recognition of endogenous 

15 peptides bound to K d or other H-2 d molecules. Hence, no TCR signal occurs and no 
induction of negative selection. The clonotypic TCR ap heterodimer and associated 
CD3sy, CD3s8 and dimers are schematically represented. RNAs were derived from 
these two cell sorter-purified DP thymocyte populations and used for RDA. 

Figure 2 shows the multiple sequence alignment with IkBNS (SEQ ID NO: 4). 

20 The full sequence of IkBNS is shown. Numbering refers to IkBNS. The ankyrin 
domains of IkBNS are boxed and labeled from A to G. Secondary structure (ss) 
predictions for IkBNS are shown above the alignment with the inner helix of the 
ankyrin repeat core shown in gray, and the outer in blue. Secondary structure motifs of 
IkBcc were obtained from pdb 1NFI (Jacobs and Harrison, 1998) and are shown below 

25 the alignment. Sequences were aligned using the program ClustalX (Thompson et aL, 
1997), and the secondary structure prediction of IkBNS was determined using PSI- 
PRED (Jones, 1999). A dendrogram of the figure is displayed with IkBoc as the root and 
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was derived using the Neighbor) oining method (Saitou and Nei, 1987). Amino acid 
positions with identities or similarities in 5 or more of the 6 proteins aligned are 
highlighted in black with yellow letters. Amino acid positions with identities or 
similarities in four or more of the 6 proteins are highlighted in gray. For this analysis, 
5 V/L/I, S/T, N/Q, D/E, K/R and W/F are considered equal. Residues shown are: for 
human IkBoc aa 66-287, for human Bcl-3 aa 31-278, for murine IkB( aa 292-629, for 
human pi 05 aa 522-756 and for murine pi 00 aa 467-705. 

Figure 3 A is an RNA blot analysis of IkBNS expression in mouse tissues. A 
mouse multiple tissue Northern was probed with random-prime labeled IkBNS (upper 

10 panel) or p-actin control probe (lower panel). Each lane contains 2 |ig of polyA+ 

mRNA. The IkBNS blot was exposed for 4 days; the P-actin blot was exposed for 4 h. 
The position of a 2.0 kb marker is indicated. 

Figure 3B is an RNA blot analysis of IkBNS expression in thymii of variously 
treated animals. 10 \ig of total thymic RNA was run in each lane. RNA was isolated 

15 from the thymus of untreated N15tg RAG-2" A H- 2d animals, N15tg RAG-2' 7 " H-2 b 

animals 1 h after i.v. injection of 24 [ig VSV8, untreated TCRCyt5CC7-I-RAG-2 A H-2 a 
animals, TCRCyt5CC7-I-RAG-2 A H-2 a animals 1 h after i.v. injection of 24 |ig PCC 
peptide, untreated C57BL/6 animals and C57BL/6 animals 1 and 2 h after injection of 
0.5 mg dexamethasone or 500 rads whole body y-irradiation. 

20 Figure 3C is an RNA blot analysis of IkBNS expression in fetal thymic organ 

cultures (FTOC). 4|ig of total thymic RNA isolated from N15tg RAG-2' 7 " H-2 b FTOC 
was run in each lane. RNAs were prepared from FTOC after 2 h incubation with 10 |iM 
of the indicated peptides. RNAs were run on agarose/formaldehyde gels, transferred to 
nylon membranes and probed first with randomprimed labeled IkBNS (upper panel) and 

25 then with random-primed labeled GAPDH (lower panel) as a loading control. The 
position of the 18S ribosomal RNA band is indicated. 

Figure 4 demonstrates that IkBNS inhibits NF-KB-induced expression of 
luciferase in transfected Cos-7 cells. Cos- 7 cells were cotransfected with pRL-null and 
(kB3) NF-KB/luciferase constructs plus empty vector or, alternatively, cotransfected 
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with pRL-null and (kB3) NF-KB/luciferase constructs plus IkBNS or iKBa constructs. 
24 h after transfection, PMA was added to induce NF-kB activity. After 7 h, Cos-7 cells 
were lysed and luciferase activity determined. The relative light units (RLU) are shown 
x 10" 3 on the ordinate. One representative experiment out of 5 is shown. 
5 Figure 5 shows the inhibition of NF-kB EMS As by IkBNS. Nuclear thymic 

ly sates were prepared from N15tg RAG-2" 7 " H-2 b animals untreated (Control) or 1 h after 
i.v. injection of 24 |ag VSV8 (VSV8) and assayed by EMS A with no additions (Control 
and -), 1 Hg of GST protein, 1 \xg of GST-IkBNS, or 1 \xg GST-IkBcc. Also added were 
a 100 fold excess of cold NF-kB probe or cold AP-1 probe. The upper panel uses an 
10 NF-kB probe and the lower panel an AP-1 probe. A probe only lane (Probe) without 
lysate is shown. 

Figure 6A is an analysis of the interaction of GST-IkBNS or GST-IkBoc with 
NF-kB components in vivo and in vitro. Nuclear and cytosolic thymic lysates were 
prepared from uninjected N15 TCRtg RAG-2" A H-2 b mice or mice 10, 30 or 60 min after 
15 VSV8 peptide injection. Western blot analysis was performed on cytosolic and nuclear 
lysates for p50, p65 and RelB NF-kB proteins. The same amount of protein was added 
to each lane. 

Figure 6B is further analysis of the interaction of GST-IkBNS or GST-IkBoc 
with NF-kB components in vivo and in vitro. Cytosolic and nuclear fractions of thymic 

20 lysates were incubated with GST (Ctl), GST-IkBNS or GST-IkBcc proteins bound to 
beads and the interacting proteins identified by Western blot analysis with the indicated 
subunit-specific antibodies following SDS-PAGE. 

Figure 6C is an analysis of the interaction of IkBNS and IkBcx with in vitro 
translated p50 and p65 proteins. [ 35 S]Radiolabeled murine NF-kB proteins were 

25 produced by in vitro translation and incubated individually or in combination with 5 |ag 
GST, GST-IkBNS or GST-IkBcc coupled to beads. Associated proteins were identified 
by autoradiography after SDS-PAGE. In each pair of lanes incubated with an NF-kB 
protein(s), the left lane shows binding to the GST control beads and the right lane shows 
interaction with GST-IkBNS- (upper panel) or GST-IkBcc- (lower panel) beads. 
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Figure 7 A demonstrates that IkBNS antisense oligonucleotides block VSV8- 
induced negative selection in N15tg RAG-2"'" H-2 b FTOC. FACS analysis of DP 
thymocytes in FTOC from N15tg RAG-2 _/ " H- 2b mice after no addition (None), VSV8 
peptide addition for 4 h (VSV8), or VSV8 addition for 4 h following 18 h of antisense 
5 (AS + VSV8) or sense (S + VSV8) oligonucleotide treatment. The expression of CD4 
(Y axis) or CD8 (X axis) was analyzed by two-color flow cytometry after gating on live 
cells. The percentage of DP thymocytes is indicated. 

Figure 7B represents the number of cells for each thymic lobe (Total), the DP 
population (DP) and the CD8 SP population (CD8 SP). One representative experiment 
10 of five is shown. 

Figure 8 is a model of the role of IkBNS in negative selection. Green arrows 
indicate TCR pathways in DP thymocytes induced by positively selecting stimuli while 
red arrows denote those of negatively selecting stimuli. 

Figures 9A-9I are the human genomic sequence of IkBNS (SEQ ID NO: 1). The 
15 exons are indicated by shading. Exon I spans 1024-121 5bp; exon II spans 1297- 

1401bp; exon III spans 1589-1734bp; exon IV spans 1858-191 lbp; exon V spans 2001- 
2147bp; exon VI spans 7637-7753bp; exon VII spans 8073-8237bp; and exon VIII 
spans 9515-9568bp. 

Figure 10 is the mouse IkBNS cDNA sequence (SEQ ID NO: 3). The open 
20 reading frame (ORF) spans 473-1456 bp, and is indicated by shading. 

Figure 1 1 is the sequence alignment of human (SEQ ID NO: 2) and mouse (SEQ 
ID NO: 4) IkBNS protein sequences. In the center line between human and mouse 
sequences, amino acid identity is denoted by the identical amino acid; conservative 
amino acids are denoted by "+"; stretched gaps in the amino acid sequence are denoted 
25 by non-identity between amino acid sequences are denoted as blank spaces. 

The foregoing and other objects, features and advantages of the invention will be 
apparent from the following more particular description of preferred embodiments of 
the invention. 
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DETAILED DESCRIPTION OF THE INVENTION 

Apoptosis is a morphologically stereotyped form of programmed cell death 
utilized by metazoan organisms during normal development as well as for homeostasis 
(Kerr et al, 1972; Horvitz et al, 1982; Steller, 1995; Vaux and Korsmeyer, 1999). 
5 Excess cells at each stage of organogenesis are eliminated by this mechanism. Within 
the hematopoetic system this phenomenon is well described (Cohen et al, 1992). 

In T lymphoid development, ocp T cell receptors (TCR) are generated by a 
stochastic process of rearrangement of variable gene segments. Through combinatorial 
diversity, billions of different TCRs are generated ensuring protection against a myriad 

10 of infectious agents without a requirement to encode all TCR specificities in the 

germline (Davis and Bjorkman, 1988). One potential hazard of this diversity-generating 
mechanism is the creation of self-reactive TCRs. Thus, T cell repertoire generation must 
be carefully regulated through a developmental selection program termed negative 
selection (von Boehmer, 1991). Those TCRs that recognize self-MHC molecules and 

1 5 their associated peptides (pMHC) too strongly are eliminated by negative selection 
whereas those thymocytes with low affinity for self-MHC undergo maturation and are 
exported to the periphery in a process termed positive selection (Goldrath and Bevan, 
1999). These selection processes act predominantly at the level of immature 
CD4+CD8+ double positive (DP) thymocytes and appear to be biochemically distinct 

20 (Sebzda et al, 1999). TCRs and peptide-binding MHC molecules have co-evolved to 
interact with one another such that a given TCR can distinguish subtle differences 
between self and foreign (i.e., infectious) peptides bound to an MHC molecule 
(Reinherz et al, 1999). The TCR-related selection processes are complex, being 
dependent not only on the monomeric affinity of the TCR-pMHC interaction (Alam et 

25 al, 1996) but the interplay between co-stimulation, co-receptor ligation and 
TCR/pMHC affinity (Sebzda et al, 1999). 

There are two major pathways of cell execution after the receipt of a death 
signal: the caspase enzymatic cascade and mitochondrial dysfunction (Gross et al., 
1999). Bcl2 family members play a pivotal role in deciding whether a cell will live or 
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die through their activities executed at the level of mitochondrial membrane 
permeability. However, over-expression of Bcl-2 and Bcl-xL in transgenic mice does 
not affect TCR-induced cell death suggesting that TCR-triggered death depends on 
caspase activity (Rathmell and Thompson, 1999). In fact, TCR triggering by "self- 
5 reactive" pMHC ligands during thymic negative selection activates a caspase in DP 
thymocytes (Clayton et al, 1997). Inhibition of this enzymatic activity with caspase 3- 
like inhibitors prevents antigen-induced death of DP thymocytes in fetal thymic organ 
culture (FTOC) from TCR transgenic mice as well as apoptosis induced by anti-TCR 
mAb crosslinking and corticosteroids in FTOC of normal C57BL/6 mice (Clayton et al, 

10 1997). Hence a caspase mediates immature thymocyte susceptibility to cell death. 

Consistent with this data, gene deletion studies have shown that caspases 3, 8 and 9 are 
required for cell death during mammalian development (Vaux and Korsmeyer, 1999). 

In mammalian cells, certain apoptotic pathways require gene transcription 
whereas others do not. Examples of the former are dexamethasone-induced or 

15 superantigen-induced lymphocyte death while apoptosis triggered by CD95 is an 

example of the latter (D'Adamio et al, 1992; Vaux and Korsmeyer, 1999). Since TCR 
triggering leads to the induction of gene expression, transcription factors would also be 
expected to play a role in thymocyte development and selection. The transcription factor 
IRF-1 appears to be involved in both positive and negative selection (Penninger et al, 

20 1997) while nur77, a second transcription factor, is involved in negative selection 

processes (Liu et al, 1994; Amsen et al, 1999). Deletion of the nur77 gene in vivo (Lee 
et al, 1995) did not result in impaired thymic selection implying that other pathways 
may be operative in negative selection. In this regard, NF-kB transcription factors have 
been implicated in providing both survival and death signals (Ghosh et al, 1998) and 

25 are expressed at high levels in the thymus in a constitutively active state (Ivanov and 
Ceredig, 1992). 

Negative selection eliminates thymocytes bearing autoreactive T cell receptors 
(TCR) via an apoptotic mechanism. To identify additional key genes expressed in 
thymocytes undergoing peptide-triggered selection, a TCR transgenic mouse system was 



1062.2002-004 

t 



-13- 

utilized. Antigen-induced negative selection was induced (Ghendler et al, 1998) and 
subtractive cDNA cloning performed using representational difference analysis (RDA) 
(Lisitsyn et al, 1993; Hubank and Schatz, 1994). A subtracted library was prepared and 
a cDNA which encodes a novel functional inhibitor of NF- kB 5 termed IkBNS, was 
5 cloned. IkBNS, is rapidly expressed upon TCR-triggered but not dexamethasone- or y- 
irradiation-stimulated thymocyte death. The predicted protein contains 7 ankyrin repeats 
and is homologous to known IkB family members. In class I and class II MHC- 
restricted TCR transgenic mice, transcription of IkBNS is stimulated by peptides that 
trigger negative selection but not by those inducing positive selection (i.e., survival) or 

10 nonselecting peptides. IkBNS blocks transcription from NF-kB reporters, alters NF-kB 
electrophoretic mobility shifts and interacts with NF-kB proteins in thymic nuclear 
lysates following TCR stimulation. Inhibition of IkBNS translation blocks antigen- 
induced negative selection in fetal thymic organ culture. 

NF-kB transcription factors are a highly conserved family of dimeric proteins 

15 found in many cell types. These inducible transcription factors activate various genes in 
response to proinflammatory and noxious stimuli (Ghosh et al, 1998; Baeuerle, 1998; 
Karin and Ben- Neriah, 2000). The remarkable ability of NF-kB proteins to respond 
quickly to cell surface perturbations is a function of the special regulation of NF-kB 
transcriptional activity by inhibitors called IkB proteins. NF-kB exists in the cytoplasm 

20 complexed with an IkB protein; upon the cell receiving a signal to activate NF-kB, IkB 
is phosphorylated and degraded via the proteasome. The free NF-kB then enters the 
nucleus and activates gene transcription by binding particular DNA sites (Ghosh et al, 
1998; Karin and Ben-Neriah, 2000). Seven IkB proteins termed IkB a, M30, IkBe, 
IkBy, Bcl-3, MAIL (IkBC) (Kitamura et al, 2000; Yamazaki et al, 2001) and 

25 Interleukin-1 -inducible nuclear ankyrin-repeat protein (INAP) (Haruta et al, 2001) have 
been identified in mammalian cells (Ghosh et al y 1998). All seven proteins contain 
ankyrin domains which mediate protein-protein interaction with NF-kB homo- and 
heterodimers. IkBoc is the best characterized member of this family and functions by 
blocking the nuclear localization signal and DNA binding of NF-kB proteins thus 
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preventing nuclear entry and NF-KB-induced gene transcription until appropriate cell 
stimulation (Ghosh et al, 1998; Karin and Ben-Neriah, 2000). The IkB family members 
may function by different mechanisms. For example, IkBoc, IkBP and IkBg bind 
strongly to p65- or c-Rel-containing dimers and Bcl-3 more strongly interacts with p52 
5 and p50 homodimers (Ghosh et al, 1998). Whereas IkBoc maintains the balance of 
cytoplasmic vs nuclear NF-kB by a dynamic shuttling process, iKBp and IkB€ act to 
sequester NF-kB proteins in the cytoplasm (Johnson et al, 1999; Huang et al, 2000; 
Tarn and Sen, 2001). Bcl-3 instead is a nuclear protein and acts to increase transcription 
of genes with NF-KB-sensitive promoters perhaps by removing the inhibitory p50 or 

10 p52 homodimers from DNA, allowing active p65/p50 heterodimers access to NF-kB 
sites. Furthermore, Bcl-3 contains transactivation domains and may form activating 
complexes with p50 and p52 (Bours et al, 1993; Fujita et al, 1993). Other nuclear co- 
regulators bind Bcl-3 suggesting that Bcl-3 may act as a bridging factor between other 
proteins and p50 or p52 (Dechend et al, 1999). IicBy has a limited distribution and is as 

1 5 yet not well characterized (Ghosh et al, 1 998). 

The role of the NF-kB/IkB transcriptional regulatory pathways in T cell 
development is currently unclear. Gene knockouts of individual NF-kB family members 
do not demonstrate marked defects in lymphocyte development, but this lack of a 
thymic phenotype may result from functional redundancy (Ghosh et al, 1998) since 

20 double knockouts demonstrate more severe phenotypes (Gerondakis et al, 1999). For 
example, in p50/RelB and p50/p52 double knockouts, thymic atrophy was observed 
(Franzoso et al, 1997; Weih et al, 1997). In transgenic mice expressing a dominant 
negative IkBcc in the thymus, thymocytes developed normally although partial defects in 
proliferation were observed (Hettmann et al, 1999) particularly in CD44- CD25-DN 

25 thymocytes (Voll et al, 2000). Analysis of IKK0 negative fetal liver stem cells in 
reconstitution experiments suggests a similar proliferation defect in thymocytes with 
altered NF- kB activity (Senftleben et al, 2001). IKKp is a catalytic subunit of the IkB 
kinase complex that phosphorylates IkB proteins in the first step of the proteasome 
degradation pathway. Thus IKK(5 negative cells should be functionally equivalent to 
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cells containing super-repressor IkBcc. The current consensus seems to be that NF-kB 
proteins play a role in lymphocyte development and affect survival of developing 
thymocytes through effects on proliferation, protection from TNF- induced apoptosis 
and regulation of anti-apoptotic genes such as bcl-xL (Hettmann et al, 1999; Senftleben 
5 et aU 2001 ; Voll et al, 2000). 

The distribution of IkBNS expression suggests a more restricted role for this 
protein in NF-kB regulation during T cell development. Furthermore, the correlation of 
IkBNS induction with TCR-induced negative selection signals and the ability of IkBNS 
antisense oligonucleotides to block negative selection implies a very specific function in 

10 determining death/survival of immature thymocytes. Previous analyses of NF-kB 
inhibition by expression of a super-repressor IkBoc which lacks IKK phosphorylation 
sites have suggested that NF-kB inhibition affects the proliferation of thymocytes during 
development but does not disrupt development itself (Hettmann et al, 1999; Bakker et 
al, 1999; Ferreira et ai, 1999; Voll et al, 2000). An effect on apoptosis was also noted 

15 but while adeno virus-mediated expression of super-repressor IkBcc in FTOC resulted in 
increased apoptosis (Bakker et ai, 1999), a superrepressor IkBcc transgene expressed 
under the control of a CD2 promoter resulted in a decrease in anti-CD3-mediated 
thymocyte death (Hettmann et ai, 1999). Nonetheless, the thymocyte populations 
affected by NF-kB inhibition appear to be those subpopulations expressing the highest 

20 levels of NF-kB activity (Voll et al, 2000). 

If NF-kB provides survival signals to developing thymocytes, then induction of 
an inhibitor of this activity would clearly result in cell death. However, the IkBcc 
superrepressor transgene prevented anti-CD3e-triggered cell death (Hettmann et al, 
1999). Thus simply inhibiting NF-kB activation does not appear to result in TCR- 

25 triggered thymocyte apoptosis, at least as stimulated by anti-CD3e mAb crosslinking. 
To the contrary, these findings suggest that NF-kB activation is required for TCR- 
triggered thymocyte apoptosis. One explanation for this paradox is that IkBNS may 
behave in a different manner than IkBcc. IkBNS, like Bcl-3 with which it is most 
structurally homologous (Figure 2), may bind to p50 and p52 homodimers. As described 
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above, these NF-kB complexes act as inhibitors of transcription by blocking NF-kB 
sites on DNA. IkBNS may remove these inactive complexes from DNA and redirect 
TCR-triggered gene transcription (Franzoso et al, 1992; Bours et al, 1993; Fujita et al, 
1993). Alternatively, IkBNS may also act as a bridging component between NF-kB 
5 dimers and other proteins. One model for the role of IkBNS in negative selection is 
offered in Figure 8. Upon receipt of a positively selecting signal through the TCR, IKK 
is activated. IkBcc is then degraded allowing NF-kB to translocate to the nucleus and 
induce transcription of genes required for survival (as indicated by the green arrows). 
However, upon receipt of a negatively selecting signal, transcription of IkBNS is 

10 induced in addition (as indicated by the red arrows). We believe that IkBNS has two 
potential modes of action. IkBNS enters the nucleus, binds to NF- kB/DNA complexes 
and removes these active complexes from the DNA thereby blocking transcription of the 
"survival genes" (mechanism 1). A second possibility is that IkBNS binds NF-kB 
dimers and redirects the active complex to a new site resulting in the transcription of 

15 genes which execute the cell (mechanism 2). IkBNS may induce these "execution" 
genes on its own or require an additional factor(s) present in DP thymocytes. 

With respect to the panel of peptides tested herein, a similar pattern of mRNA 
induction for IkBNS and nur77 was observed, a transcription factor also linked to 
negative selection processes (Liu et al, 1994; Amsen et al, 1999). The similar pattern 

20 of gene expression suggests that the same TCR signal may induce nur77 and IkBNS as a 
consequence of activation of a very early upstream gene which regulates both or that 
TCR-triggered post-translational modification of a protein(s) then activates these two 
genes directly. From characterization of the RDA products generated herein, nur77 and 
IkBNS account for the vast majority of differentially expressed clones. For the current 

25 analysis Nl 5tg RAG-2" 7 ' TCR H-2 b transgenic mice bearing a single TCR recognizing 
the VSV8 peptide complexed with the class I MHC Kb molecule were utilized. A 
strength of this system is the accessibility of differentially selecting peptide ligands and 
the detailed knowledge available concerning the affects of small structural alterations of 
peptide sidechains on the signal transmitted to the T cell or thymocyte (Ghendler et al, 
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1998; Sasada et al, 2000). This information allows the correlation of the induction of 
IkBNS with those peptide stimuli which trigger negative selection in N15tg RAG-2" 7 ' 
TCR H-2 b transgenic mice. Similarly, in the class II system, TCRCyt5CC7-I-RAG-2' A 
H-2 a mice injected with the negatively selecting PCC peptide also express IkBNS 
5 message in the thymus within 1 h of treatment. Thus IkBNS serves as a molecular 
marker of negative selection in both class I and class II TCR transgenic mice. In wild 
type C57BL/6 mice, IkBNS is induced by anti-CD3e injection, indicating a role for 
IkBNS in negative selection of non-transgenic mice as well. That IkBNS gene 
expression is strongly linked to TCR-triggered negative selection and not other forms of 
10 apoptosis is clear from the observation that y -irradiation and dexamethasone fail to 
induce IkBNS message in thymocytes. Transcription of two recently identified iKB-like 
genes is also induced by cell surface triggering; MAIL (IkBQ is induced by LPS 
(Kitamura et al, 2000; Yamazaki et al, 2001) and INAP is induced by IL-1 (Haruta et 
al, 2001). 

1 5 That TCR triggering differentially induces transcription of genes and 

biochemical signaling events predicated on the extent of TCR-crosslinking or ligand 
affinity is well documented (Meuer et al, 1984; Fujita et al, 1986; Sloan-Lancaster et 
al, 1994; Gong et al, 2001). Correlative studies suggest that the TCR signal required to 
induce positive selection necessitates a weaker TCR-pMHC interaction than that 

20 required for negative selection (Alam et al, 1996). Thus, exceeding a particular affinity 
threshold leads to negative selection. In this respect, IkBNS appears to be a qualitative 
transducer of divergent TCR ligation signals: those signals resulting in negative 
selection induce transcription of this gene while nonselecting or positively selecting 
peptides for which the TCR has less affinity do not. 

25 As described herein, novel nucleic acid molecules have been identified from 

thymocytes triggered to undergo negative selection. These nucleic acid molecules, or a 
subset thereof, can serve as markers to identify and characterize negative selection of T 
cells. 
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In one embodiment, the nucleic acid molecules consist of a nucleotide sequence 
selected from the group consisting of SEQ ID NOS: 1 and 3 , the complement of SEQ ID 
NOS: 1 and 3, nucleic acid molecules that encode SEQ ID NOS: 2 and 4, and a nucleic 
acid molecule consisting of exons I to XIII of SEQ ID NO: 1 in a contiguous sequence, 
5 or combinations or permutations of exons I to XIII of SEQ ID NO: 1 in a contiguous 
sequence. In another embodiment, the nucleic acid molecules comprise a nucleotide 
sequence selected from the group consisting of SEQ ID NO: 3, the complement of SEQ 
ID NO: 3, and nucleic acid molecules that encode SEQ ID NOS: 2 and 4, and a nucleic 
acid molecule consisting of exons I to XIII of SEQ ID NO: 1 in a contiguous sequence, 

10 or combinations or permutations of exons I to XIII of SEQ ID NO: 1 in a contiguous 
sequence. In another embodiment, the nucleic acid molecule hybridizes under high 
stringency conditions to a nucleotide sequence selected from the group consisting of 
SEQ ID NOS: 1 and 3, the complement of SEQ ID NOS: 1 and 3, and nucleic acid 
molecules that encode SEQ ID NOS: 2 and 4. In a further embodiment, the nucleic acid 

1 5 molecule of the invention comprises a nucleotide sequence which is greater than about 
75 percent, and more preferably greater than about 80 percent, and even more preferably 
greater than about 90 percent, identical to a nucleotide sequence selected from the group 
consisting of SEQ ID NOS: 1 and 3, the complement of SEQ ID NOS: 1 and 3, and 
nucleic acid molecules that encode SEQ ID NOS: 2 and 4. In a preferred embodiment, 

20 the nucleic acid molecule which hybridizes under conditions of high stringency to a 
nucleotide sequence selected from the group consisting of SEQ ID NOS: 1 and 3, the 
complement of SEQ ID NOS: 1 and 3, and the nucleic acid molecules that encode SEQ 
ID NOS: 2 and 4 is isolated from mammalian tissue. 

The invention further relates to an isolated portion of any of SEQ ID NOS: 1 and 

25 3, the complement of SEQ ID NOS: 1 and 3, and nucleic acid molecules that encode 
SEQ ID NOS: 2 and 4, which portion is sufficient in length to distinctly characterize the 
sequence. For example, the isolated portion can be from about 15 to about 25 
nucleotides in length, nucleotides in length, and more preferably from about 25 to about 
40 nucleotides in length. 
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As appropriate, nucleic acid molecules of the present invention can be RNA, for 
example, mRNA, or DNA, such as cDNA and genomic DNA. DNA molecules can be 
double-stranded or single-stranded; single stranded RNA or DNA can be the coding or 
sense strand, or the non-coding or antisense, strand. Preferably, the nucleic acid 
5 molecule comprises at least about 10 nucleotides, more preferably at least about 50 
nucleotides, and even more preferably at least about 200 nucleotides. The nucleic acid 
molecule can include all or a portion of the coding sequence of a gene and can further 
comprise additional non-coding sequences such as introns and non-coding 3' and 5' 
sequences (including regulatory sequences, for example). Additionally, the nucleic acid 

10 molecule can be fused to a marker sequence, for example, a sequence which encodes a 
polypeptide to assist in isolation or purification of the polypeptide. Such sequences 
include, but are not limited to, those which encode a glutathione-S-transferase (GST) 
fusion protein and hemophilus influenza hemaglutinin tag (HA). 

As used herein, an "isolated" gene or nucleic acid molecule is intended to mean 

15 a gene or nucleic acid molecule which is not flanked by nucleic acid molecules which 
normally (in nature) flank the gene or nucleic acid molecule (such as in genomic 
sequences) and/or has been completely or partially purified from other transcribed 
sequences (as in a cDNA or RNA library). For example, an isolated nucleic acid of the 
invention may be substantially isolated with respect to the complex cellular milieu in 

20 which it naturally occurs. In some instances, the isolated material will form part of a 
composition (for example, a crude extract containing other substances), buffer system or 
reagent mix. In other circumstance, the material may be purified to essential 
homogeneity, for example as determined by PAGE or column chromatography such as 
HPLC. Preferably, an isolated nucleic acid comprises at least about 50, 80 or 90 percent 

25 (on a molar basis) of all macromolecular species present. Thus, an isolated gene or 
nucleic acid molecule can include a gene or nucleic acid molecule which is synthesized 
chemically or by recombinant means. Thus, recombinant DNA contained in a vector are 
included in the definition of "isolated" as used herein. Also, isolated nucleic acid 
molecules include recombinant DNA molecules in heterologous host cells, as well as 
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partially or substantially purified DNA molecules in solution. In vivo and in vitro RNA 
transcripts of the DNA molecules of the present invention are also encompassed by 
"isolated" nucleic acid molecules. Such isolated nucleic acid molecules are useful in the 
manufacture of the encoded protein, as probes for isolating homologous sequences (e.g., 
5 from other mammalian species), for gene mapping (e.g., by in situ hybridization with 
chromosomes), or for detecting expression of the gene in tissue (e.g., human tissue such 
as brain tissue), such as by Northern blot analysis. 

The invention described herein also relates to fragments or portions of the 
isolated nucleic acid molecules described above. The term "fragment" is intended to 

10 encompass a portion of a nucleic acid molecule described herein which is from at least 
about 25 contiguous nucleotides to at least about 40 contiguous nucleotides or longer in 
length; such fragments are useful as probes, e.g., for diagnostic methods and also as 
primers. Particularly preferred primers and probes selectively hybridize to nucleic acid 
molecules comprising the nucleotide sequences of any of SEQ ID NOS: 1 and 3, the 

15 complement of SEQ ID NOS: 1 and 3, and nucleic acid molecules that encode SEQ ID 
NOS: 2 and 4. For example, fragments which encode antigenic proteins or polypeptides 
described herein are useful. 

The invention also pertains to nucleic acid molecules which hybridize under 
high stringency hybridization conditions (e.g., for selective hybridization) to a 

20 nucleotide sequence described herein. Hybridization probes are oligonucleotides which 
bind in a base-specific manner to a complementary strand of nucleic acid. Such probes 
include polyamide nucleic acids, as described in Nielsen et aL, Science 254, 1497-1500 
(1991). Appropriate stringency conditions are known to those skilled in the art or can 
be found in standard texts such as Current Protocols in Molecular Biology, John Wiley 

25 & Sons, N.Y. (1989), 6.3.1-6.3.6. For example, stringent hybridization conditions 

include a salt concentration of no more than 1 M and a temperature of at least 25 °C. In 
one embodiment, conditions of 5X SSPE (750 mM NaCl, 50 mM NaPhosphate, 5 mM 
EDTA, pH 7.4) and a temperature of 25-30°C, or equivalent conditions, are suitable for 
specific probe hybridizations. Equivalent conditions can be determined by varying one 
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or more of the parameters given as an example, as known in the art, while maintaining a 
similar degree of identity or similarity between the target nucleic acid molecule and the 
primer or probe used. Hybridizable nucleic acid molecules are useful as probes and 
primers for diagnostic applications. 
5 As used herein, the term "primer" refers to a single-stranded oligonucleotide 

which acts as a point of initiation of template-directed DNA synthesis under appropriate 
conditions (e.g., in the presence of four different nucleoside triphosphates and an agent 
for polymerization, such as, DNA or RNA polymerase or reverse transcriptase) in an 
appropriate buffer and at a suitable temperature. The appropriate length of a primer 

10 depends on the intended use of the primer, but typically ranges from 15 to 30 

nucleotides. Short primer molecules generally require cooler temperatures to form 
sufficiently stable hybrid complexes with the template. A primer need not reflect the 
exact sequence of the template, but must be sufficiently complementary to hybridize 
with a template. The term "primer site" refers to the area of the target DNA to which a 

15 primer hybridizes. The term "primer pair" refers to a set of primers including a 5' 

(upstream) primer that hybridizes with the 5' end of the DNA sequence to be amplified 
and a 3 f (downstream) primer that hybridizes with the complement of the 3' end of the 
sequence to be amplified. 

Accordingly, the invention pertains to nucleic acid molecules which have a 

20 substantial identity with the nucleic acid molecules described herein; particularly 
preferred are nucleic acid molecules which have at least about 90%, and more 
preferably at least about 95% identity with nucleic acid molecules described herein. 
Thus, DNA molecules which comprise a sequence which is different from the naturally- 
occurring nucleic acid molecule but which, due to the degeneracy of the genetic code, 

25 encode the same protein or polypeptide are the subject of this invention. The invention 
also encompasses variations of the nucleic acid molecules of the invention, such as 
those encoding portions, analogues or derivatives of the encoded protein or polypeptide. 
Such variations can be naturally-occurring, such as in the case of allelic variation, or 
non-naturally-occurring, such as those induced by various mutagens and mutagenic 
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processes. Variations include, but are not limited to, addition, deletion and substitution 
of one or more nucleotides which can result in conservative or non-conservative amino 
acid changes, including additions and deletions. Preferably, the nucleotide or amino 
acid variations are silent; that is, they do not alter the characteristics or activity of the 
5 encoded protein or polypeptide. As used herein, activities of the encoded protein or 
polypeptide include, but are not limited to, catalytic activity, binding function, antigenic 
function and oligomerization function. 

The nucleotide sequences of the nucleic acid molecules described herein, e.g., 
SEQ ID NOS: 1 and 3, the complement of SEQ ID NOS: 1 and 3, and nucleic acid 

10 molecules that encode SEQ ID NOS: 2 and 4, can be amplified by methods known in 
the art. For example, this can be accomplished by e.g., PCR. See generally PCR 
Technology: Principles and Applications for DNA Amplification (ed. H. A. Erlich, 
Freeman Press, NY, NY, 1992); PCR Protocols: A Guide to Methods and Applications 
(eds. Innis, et al., Academic Press, San Diego, CA, 1990); Mattila et al., Nucleic Acids 

15 Res. 19, 4967 (1991); Eckert et al, PCR Methods and Applications 1, 17 (1991); PCR 
(eds. McPherson et al, IRL Press, Oxford); and U.S. Patent 4,683,202. 

Other suitable amplification methods include the ligase chain reaction (LCR) 
(see Wu and Wallace, Genomics 4, 560 (1989), Landegren et al., Science 241, 1077 
(1988), transcription amplification (Kwoh et al., Proc. Natl. Acad. Sci. USA 86, 1 173 

20 (1989)), and self-sustained sequence replication (Guatelli et al., Proc. Nat. Acad. Sci. 
USA, 87, 1874 (1990)) and nucleic acid based sequence amplification (NASBA). The 
latter two amplification methods involve isothermal reactions based on isothermal 
transcription, which produce both single stranded RNA (ssRNA) and double stranded 
DNA (dsDNA) as the amplification products in a ratio of about 30 or 100 to 1, 

25 respectively. 

The amplified DNA can be radiolabeled and used as a probe for screening a 
cDNA library derived from thymus tissue, e.g., human thymus tissue, mRNA in Xzap 
express, ZIPLOX or other suitable vector. Corresponding clones can be isolated, DNA 
can obtained following in vivo excision, and the cloned insert can be sequenced in either 
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or both orientations by art recognized methods, to identify the correct reading frame 
encoding a protein of the appropriate molecular weight. For example, the direct 
analysis of the nucleotide sequence of nucleic acid molecules of the present invention 
can be accomplished using either the dideoxy chain termination method or the Maxam - 
5 Gilbert method (see Sambrook et al, Molecular Cloning, A Laboratory Manual (2nd 
Ed., CSHP, New York 1989); Zyskind et al, Recombinant DNA Laboratory Manual, 
(Acad. Press, 1988)). Using these or similar methods, the protein(s) and the DNA 
encoding the protein can be isolated, sequenced and further characterized. 

With respect to protein or polypeptide identification, bands identified by gel 

10 analysis can be isolated and purified by HPLC, and the resulting purified protein can be 
sequenced. Alternatively, the purified protein can be enzymatically digested by methods 
known in the art to produce polypeptide fragments which can be sequenced. The 
sequencing can be performed, for example, by the methods of Wilm et al {Nature 
579(6564):466-469 (1996)). The protein may be isolated by conventional means of 

1 5 protein biochemistry and purification to obtain a substantially pure product, i.e., 80, 95 
or 99% free of cell component contaminants, as described in Jacoby, Methods in 
Enzymology Volume 104, Academic Press, New York (1984); Scopes, Protein 
Purification, Principles and Practice, 2nd Edition, Springer- Verlag, New York (1987); 
and Deutscher (ed), Guide to Protein Purification, Methods in Enzymology, Vol. 182 

20 (1990). If the protein is secreted, it can be isolated from the supernatant in which the 
host cell is grown. If not secreted, the protein can be isolated from a lysate of the host 
cells. 

In addition to substantially full-length polypeptides encoded by nucleic acid 
molecules described herein, the present invention includes biologically active fragments 
25 of the polypeptides, or analogs thereof, including organic molecules which simulate the 
interactions of the polypeptides. Biologically active fragments include any portion of 
the full-length polypeptide which confers a biological function on the variant gene 
product, including ligand binding, and antibody binding. Ligand binding includes 
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binding by nucleic acids, proteins or polypeptides, small biologically active molecules, 
or large cellular structures. 

This invention also pertains to an isolated protein or polypeptide encoded by the 
nucleic acid molecules of the invention. The encoded proteins or polypeptides of the 
5 invention can be partially or substantially purified (e.g., purified to homogeneity), 
and/or are substantially free of other proteins. According to the invention, the amino 
acid sequence of the polypeptide can be that of the naturally-occurring protein or can 
comprise alterations therein. Such alterations include conservative or non-conservative 
amino acid substitutions, additions and deletions of one or more amino acids; however, 

10 such alterations should preserve at least one activity of the encoded protein or 

polypeptide, i.e., the altered or mutant protein should be an active derivative of the 
naturally-occurring protein. For example, the mutation(s) can preferably preserve the 
three dimensional configuration of the binding and/or the ankyrin repeats of the native 
protein. The presence or absence of biological activity or activities can be determined 

15 by various functional assays as described herein. Moreover, amino acids which are 
essential for the function of the encoded protein or polypeptide can be identified by 
methods known in the art. Particularly useful methods include identification of 
conserved amino acids in the family or subfamily, site-directed mutagenesis and 
alanine-scanning mutagenesis (for example, Cunningham and Wells, Science 244:1081- 

20 1085 (1989)), crystallization and nuclear magnetic resonance. The altered polypeptides 
produced by these methods can be tested for particular biologic activities, including 
immunogenicity and antigenicity. 

Specifically, appropriate amino acid alterations can be made on the basis of 
several criteria, including hydrophobicity, basic or acidic character, charge, polarity, 

25 size, the presence or absence of a functional group (e.g., -SH or a glycosylation site), 
and aromatic character. Assignment of various amino acids to similar groups based on 
the properties above will be readily apparent to the skilled artisan; further appropriate 
amino acid changes can also be found in Bowie et al. {Science 247:1306-1310(1990)). 
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The encoded polypeptide can also be a fusion protein comprising all or a portion 
of the amino acid sequence fused to an additional component. Additional components, 
such as radioisotopes and antigenic tags, can be selected to assist in the isolation or 
purification of the polypeptide or to extend the half life of the polypeptide; for example, 
5 a hexahistidine tag would permit ready purification by nickel chromatography. 
Furthermore, polypeptides of the present invention can be progenitors of the active 
protein; progenitors are molecules which are cleaved to form an active molecule. 

Polypeptides described herein can be isolated from naturally-occurring sources, 
chemically synthesized or recombinantly produced. Polypeptides or proteins of the 

10 present invention can be used as a molecular weight marker on SDS-PAGE gels or on 
molecular sieve gel filtration columns using art-recognized methods. 

The invention also provides expression vectors containing a nucleic acid 
sequence described herein, operably linked to at least one regulatory sequence. Many 
such vectors are commercially available, and other suitable vectors can be readily 

1 5 prepared by the skilled artisan. "Operably linked" is intended to meant that the nucleic 
acid molecule is linked to a regulatory sequence in a manner which allows expression of 
the nucleic acid sequence. Regulatory sequences are art-recognized and are selected to 
produce the encoded polypeptide or protein. Accordingly, the term "regulatory 
sequence" includes promoters, enhancers, and other expression control elements which 

20 are described in Goeddel, Gene Expression Technology: Methods in Enzymology 185, 
Academic Press, San Diego, CA (1990). For example, the native regulatory sequences 
or regulatory sequences native to the transformed host cell can be employed. It should 
be understood that the design of the expression vector may depend on such factors as 
the choice of the host cell to be transformed and/or the type of protein desired to be 

25 expressed. For instance, the polypeptides of the present invention can be produced by 
ligating the cloned gene, or a portion thereof, into a vector suitable for expression in 
either prokaryotic cells, eukaryotic cells or both (see, for example, Broach, et al, 
Experimental Manipulation of Gene Expression, ed. M. Inouye (Academic Press, 1983) 
p. 83; Molecular Cloning: A Laboratory Manual, 2nd Ed., ed. Sambrook et al (Cold 
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Spring Harbor Laboratory Press, 1989) Chapters 16 and 17). Typically, expression 
constructs will contain one or more selectable markers, including, but not limited to, the 
gene that encodes dihydrofolate reductase and the genes that confer resistance to 
neomycin, tetracycline, ampicillin, chloramphenicol, kanamycin and streptomycin 
5 resistance. 

Prokaryotic and eukaryotic host cells transfected by the described vectors are 
also provided by this invention. For instance, cells which can be transfected with the 
vectors of the present invention include, but are not limited to, bacterial cells such as E. 
coli (e.g., E. coli K12 strains), Streptomyces, Pseudomonas, Serratia marcescens and 

10 Salmonella typhimurium, insect cells (baculovirus), including Drosophila, fungal cells, 
such as yeast cells, plant cells and mammalian cells, such as thymocytes, Chinese 
hamster ovary cells (CHO), and COS cells. 

Thus, a nucleic acid molecule described herein can be used to produce a 
recombinant form of the protein via microbial or eukaryotic cellular processes. Ligating 

15 the nucleic acid molecule into a gene construct, such as an expression vector, and 
transforming or transfecting into hosts, either eukaryotic (yeast, avian, insect, plant or 
mammalian) or prokaryotic (bacterial cells), are standard procedures used in producing 
other well known proteins. Similar procedures, or modifications thereof, can be 
employed to prepare recombinant proteins according to the present invention by 

20 microbial means or tissue-culture technology. Accordingly, the invention pertains to the 
production of encoded proteins or polypeptides by recombinant technology. 

The proteins and polypeptides of the present invention can be isolated or 
purified (e.g., to homogeneity) from recombinant cell culture by a variety of processes. 
These include, but are not limited to, anion or cation exchange chromatography, ethanol 

25 precipitation, affinity chromatography and high performance liquid chromatography 

(HPLC). The particular method used will depend upon the properties of the polypeptide 
and the selection of the host cell; appropriate methods will be readily apparent to those 
skilled in the art. 
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The present invention also relates to antibodies which bind a polypeptide or 
protein of the invention. For instance, polyclonal and monoclonal antibodies, including 
non-human and human antibodies, humanized antibodies, chimeric antibodies and 
antigen-binding fragments thereof (Current Protocols in Immunology, John Wiley & 
5 Sons, N.Y. (1994); EP Application 173,494 (Morrison); International Patent 

Application WO86/01533 (Neuberger); and U.S. Patent No. 5,225,539 (Winters)) which 
bind to the described protein or polypeptide are within the scope of the invention. A 
mammal, such as a mouse, rat, hamster, goat or rabbit, can be immunized with an 
immunogenic form of the protein (e.g., the full length protein or a polypeptide 

10 comprising an antigenic fragment of the protein which is capable of eliciting an 
antibody response). Techniques for conferring immunogenicity on a protein or 
polypeptide include conjugation to carriers or other techniques well known in the art. 
The protein or polypeptide can be administered in the presence of an adjuvant. The 
progress of immunization can be monitored by detection of antibody titers in plasma or 

1 5 serum. Standard ELISA or other immunoassays can be used with the immunogen as 
antigen to assess the levels of antibody. 

Following immunization, anti-peptide antisera can be obtained, and if desired, 
polyclonal antibodies can be isolated from the serum. Monoclonal antibodies can also 
be produced by standard techniques which are well known in the art (Kohler and 

20 Milstein, Nature 255:495-497 (1975); Kozbar et al, Immunology Today 4:12 (1983); 
and Cole et al, Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77- 
96 (1985)). The term "antibody" as used herein is intended to include fragments 
thereof, such as Fab and F(ab) 2 . Antibodies described herein can be used to inhibit the 
activity of the polypeptides and proteins described herein, particularly in vitro and in 

25 cell extracts, using methods known in the art. Additionally, such antibodies, in 
conjunction with a label, such as a radioactive label, can be used to assay for the 
presence of the expressed protein in a cell from, e.g., a tissue sample, and can be used in 
an immunoabsorption process, such as an ELISA, to isolate the protein or polypeptide. 
Tissue samples which can be assayed include mammalian tissues, e.g., differentiated 



1062.2002-004 



-28- 

and non-differentiated cells. Examples include bone marrow, thymus, kidney, liver, 
brain, pancreas, fibroblasts and epithelium. These antibodies are useful in diagnostic 
assays, or as an active ingredient in a pharmaceutical composition. 

The present invention also pertains to pharmaceutical compositions comprising 
5 polypeptides and other compounds described herein. For instance, a polypeptide or 
protein, or prodrug thereof, of the present invention can be formulated with a 
physiologically acceptable medium to prepare a pharmaceutical composition. The 
particular physiological medium may include, but is not limited to, water, buffered 
saline, polyols (e.g., glycerol, propylene glycol, liquid polyethylene glycol) and dextrose 

10 solutions. The optimum concentration of the active ingredient(s) in the chosen medium 
can be determined empirically, according to well known procedures, and will depend on 
the ultimate pharmaceutical formulation desired. Methods of introduction of exogenous 
polypeptides at the site of treatment include, but are not limited to, intradermal, 
intramuscular, intraperitoneal, intravenous, subcutaneous, oral, rectal and intranasal. 

1 5 Other suitable methods of introduction can also include gene therapy, rechargeable or 
biodegradable devices and slow release polymeric devices. The pharmaceutical 
compositions of this invention can also be administered as part of a combinatorial 
therapy with other agents. 



Screening Assays 

20 The invention provides a method (also referred to herein as a "screening assay") 

for identifying modulators, i.e., candidate or test compounds or agents (e.g., antisense, 
polypeptides, peptidomimetics, small molecules or other drugs) which bind to nucleic 
acid molecules, polypeptides or proteins described herein or have a stimulatory or 
inhibitory effect on, for example, expression or activity of the nucleic acid molecules, 

25 polypeptides or proteins of the invention. 

In one embodiment, the invention provides assays for screening candidate or test 
compounds which bind to or modulate the activity of protein or polypeptide described 
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herein or biologically active portion thereof The test compounds of the present 
invention can be obtained using any of the numerous approaches in combinatorial 
library methods known in the art, including: biological libraries; spatially addressable 
parallel solid phase or solution phase libraries; synthetic library methods requiring 
5 deconvolution; the 'one-bead one-compound' library method; and synthetic library 
methods using affinity chromatography selection. The biological library approach is 
limited to polypeptide libraries, while the other four approaches are applicable to 
polypeptide, non-peptide oligomer or small molecule libraries of compounds (Lam, K.S. 
(1997) Anticancer Drug Des. 12: 145). 

10 Examples of methods for the synthesis of molecular libraries can be found in the 

art, for example in DeWitt et al (1993) Proc. Natl Acad Sci. U.S.A., 90:6909; Erb et 
al (1994) Proc. Natl. Acad. Sci. U.S.A., 97:11422; Zuckermann et al (1994). J. Med. 
Chem., 37:2678; Cho et a/.(1993) Science, 261:1303; Carell et al (1994) Angew. Chem. 
Int. Ed Engl, 33:2059; Carell et al (1994) Angew. Chem. Int. Ed Engl, 33:2061; and 

15 in Gallop et al (1994) J. Med Chem., 37:1233. 

Libraries of compounds may be presented in solution (e.g., Houghten (1992) 
Biotechniques, 73:412-421), or on beads (Lam (1991) Nature, 354:82-84), chips (Fodor 
(1993) Nature, 3tf4;555-556), bacteria (Ladner USP 5,223,409), spores (Ladner USP 
'409), plasmids (Cull et al (1992) Proc. Natl Acad. Sci. U.S.A., 59:1865-1869) or on 

20 phage (Scott and Smith (1990) Science, 249:386-390); (Devlin (1990) Science, 249:404- 
406); (Cwirla et al (1990) Proc. Natl Acad Sci. U.S.A., 97:6378-6382); (Felici (1991) 
J. Mol Biol, 222:301-310); (Ladner supra). 

In one embodiment, an assay is a cell-based assay in which a cell which 
expresses an encoded protein which is contacted with a test compound and the ability of 

25 the test compound to bind to the protein is determined. The cell, for example, can be of 
mammalian origin, such as a thymocyte. Determining the ability of the test compound 
to bind to the polypeptide can be accomplished, for example, by coupling the test 
compound with a radioisotope, enzymatic label or fluorescent label, such that binding of 
the test compound to the polypeptide can be determined by detecting the labeled with 
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125 I, 35 S, 14 C, or 3 H, either directly or indirectly, and the radioisotope detected by direct 
counting of radioemmission or by scintillation counting. Alternatively, test compounds 
can be enzymatically labeled with, for example, horseradish peroxidase, alkaline 
phosphatase, or luciferase, and the enzymatic label detected by determination of 
5 conversion of an appropriate substrate to product. 

It is also within the scope of this invention to determine the ability of a test 
compound to interact with the polypeptide without the labeling of any of the 
interactants. For example, a microphysiometer can be used to detect the interaction of a 
test compound with the polypeptide without the labeling of either the test compound or 

10 the polypeptide. McConnell, H.M. et al (1992) Science, 257:1906-1912. As used 
herein, a "microphysiometer" (e.g., Cytosensor™) is an analytical instrument that 
measures the rate at which a cell acidifies its environment using a light-addressable 
potentiometric sensor (LAPS). Changes in this acidification rate can be used as an 
indicator of the interaction between ligand and polypeptide. 

1 5 In another embodiment, an assay is a cell-based assay comprising contacting a 

cell expressing a particular target molecule described herein with a test compound and 
determining the ability of the test compound to modulate or alter (e.g. stimulate or 
inhibit) the activity of the target molecule. Determining the ability of the test compound 
to modulate the activity of the target molecule can be accomplished, for example, by 

20 determining the ability of the polypeptide to inhibit NFicB-induced gene expression. 

In a preferred embodiment, determining the ability of the polypeptide to inhibit 
NFicB-induced expression can be accomplished by determining the activity of the target 
molecule. For example, detecting the induction of a reporter gene (comprising a target- 
responsive regulatory element operatively linked to a nucleic acid encoding a detectable 

25 marker, e.g., luciferase) or detecting a cellular response, for example, apoptosis, 
differentiation or rate of proliferation. 

In yet another embodiment, an assay of the present invention is a cell-free assay 
in which protein of the invention or biologically active portion thereof is contacted with 
a test compound and the ability of the test compound to bind to the protein or 
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biologically active portion thereof is determined. Binding of the test compound to the 
protein can be determined either directly or indirectly as described above. In one 
embodiment, the assay includes contacting the protein or biologically active portion 
thereof with a known compound which binds the protein to form an assay mixture, 
5 contacting the assay mixture with a test compound, and determining the ability of the 
test compound to interact with the protein. Determining the ability of the test compound 
to interact with the protein comprises determining the ability of the test compound to 
preferentially bind to the protein or biologically active portion thereof as compared to 
the known compound. 

10 In another embodiment, the assay is a cell-free assay in which a protein of the 

invention or biologically active portion thereof is contacted with a test compound and 
the ability of the test compound to modulate or alter (e.g., stimulate or inhibit) the 
activity of the protein or biologically active portion thereof is determined. Determining 
the ability of the test compound to modulate the activity of the protein can be 

15 accomplished, for example, by determining the ability of the protein to bind to a known 
target molecule by one of the methods described above for determining direct binding. 
Determining the ability of the protein to bind to a target molecule can also be 
accomplished using technology such as real-time Bimolecular Interaction Analysis 
(BIA). Sjolander, S. and Urbaniczky, C. (1991) Anal Chem., 55:2338-2345 and Szabo 

20 et al (1995) Curr. Opin. Struct. Biol, 5:699-705. As used herein, "BIA" is a 

technology for studying biospecific interactions in real time, without labeling any of the 
interactants (e.g., BIAcore™). Changes in the optical phenomenon surface plasmon 
resonance (SPR) can be used as an indication of real-time reactions between biological 
molecules. 

25 In an alternative embodiment, determining the ability of the test compound to 

modulate the activity of a protein of the invention can be accomplished by determining 
the ability of the protein to further modulate the activity of a target molecule. For 
example, the activity of NFicB-induced gene expression as previously described. 
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In yet another embodiment, the cell-free assay involves contacting a protein of 
the invention or biologically active portion thereof with a known compound which 
binds the protein to form an assay mixture, contacting the assay mixture with a test 
compound, and determining the ability of the test compound to interact with the protein, 
5 wherein determining the ability of the test compound to interact with the protein 

comprises determining the ability of the protein to preferentially bind to or modulate the 
activity of a target molecule. 

The cell-free assays of the present invention are amenable to use of both soluble 
and/or membrane-bound forms of isolated proteins. In the case of cell-free assays in 

10 which a membrane-bound form an isolated protein is used it may be desirable to utilize 
a solubilizing agent such that the membrane-bound form of the isolated protein is 
maintained in solution. Examples of such solubilizing agents include non-ionic 
detergents such as n-octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl- 
N-methylglucamide, decanoyl-N-methylglucamide, Triton®X-100, Triton® X-l 14, 

15 Thesit®, Isotridecypoly (ethylene glycol ether) n , 3-[(3-cholamidopropyl) 
dimethylamminio]-l -propane sulfonate (CHAPS), 3-[(3-cholamidopropyl) 
dimethylamminio]-2-hydroxy-l -propane sulfonate (CHAPSO), or N-dodecyl-N, N- 
dimethyl-3-ammonio- 1 -propane sulfonate. 

In more than one embodiment of the above assay methods of the present 

20 invention, it may be desirable to immobilize either the protein or its target molecule to 
facilitate separation of complexed from uncomplexed forms of one or both of the 
proteins, as well as to accommodate automation of the assay. Binding of a test 
compound to the protein, or interaction of the protein with a target molecule in the 
presence and absence of a candidate compound, can be accomplished in any vessel 

25 suitable for containing the reactants. Examples of such vessels include microtitre plates, 
test tubes, and micro-centrifuge tubes. In one embodiment, a fusion protein can be 
provided which adds a domain that allows one or both of the proteins to be bound to a 
matrix. For example, glutathione- S -transferase fusion proteins can be adsorbed onto 
glutathione sepharose beads (Sigma Chemical, St. Louis, MO) or glutathione 
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derivatized microtitre plates, which are then combined with the test compound or the 
test compound and either the non-adsorbed target protein or protein of the invention, 
and the mixture incubated under conditions conducive to complex formation (e.g., at 
physiological conditions for salt and pH). Following incubation, the beads or microtitre 
5 plate wells are washed to remove any unbound components, the matrix immobilized in 
the case of beads, complex determined either directly or indirectly, for example, as 
described above. Alternatively, the complexes can be dissociated from the matrix, and 
the level of binding or activity determined using standard techniques. 

Other techniques for immobilizing proteins on matrices can also be used in the 

10 screening assays of the invention. For example, either a protein of the invention or a 
target molecule can be immobilized utilizing conjugation of biotin and streptavidin. 
Biotinylated protein of the invention or target molecules can be prepared from biotin- 
NHS(N-hydroxy-succinimide) using techniques well known in the art (e.g., 
biotinylation kit, Pierce Chemicals, Rockford, IL), and immobilized in the wells of 

15 streptavidin-coated 96 well plates (Pierce Chemical). Alternatively, antibodies reactive 
with a protein of the invention or target molecules, but which do not interfere with 
binding of the protein to its target molecule, can be derivatized to the wells of the plate, 
and unbound target or protein trapped in the wells by antibody conjugation. Methods 
for detecting such complexes, in addition to those described above for the GST- 

20 immobilized complexes, include immunodetection of complexes using antibodies 

reactive with the protein or target molecule, as well as enzyme-linked assays which rely 
on detecting an enzymatic activity associated with the protein or target molecule. 

In another embodiment, modulators of expression of nucleic acid molecules of 
the invention are identified in a method wherein a cell is contacted with a candidate 

25 compound and the expression of appropriate mRNA or protein in the cell is determined. 
The level of expression of appropriate mRNA or protein in the presence of the candidate 
compound is compared to the level of expression of mRNA or protein in the absence of 
the candidate compound. The candidate compound can then be identified as a 
modulator of expression based on this comparison. For example, when expression of 
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mRNA or protein is greater (statistically significantly greater) in the presence of the 
candidate compound than in its absence, the candidate compound is identified as a 
stimulator or enhancer of the mRNA or protein expression. Alternatively, when 
expression of the mRNA or protein is less (statistically significantly less) in the 
5 presence of the candidate compound than in its absence, the candidate compound is 
identified as an inhibitor of the mRNA or protein expression. The level of mRNA or 
protein expression in the cells can be determined by methods described herein for 
detecting mRNA or protein. 

In yet another aspect of the invention, the proteins of the invention can be used 

10 as "bait proteins" in a two-hybrid assay or three-hybrid assay (see, e.g., U.S. Patent No. 
5,283,317; Zervos et al (1993) Cell 72:223-232; Madura et al (1993) J. Biol Chem., 
2(5(9:12046-12054; Bartel et al (1993) Biotechniques, 74:920-924; Iwabuchi et al 
(1993) Oncogene, 5:1693-1696; and Brent WO94/10300), to identify other proteins 
(captured proteins) which bind to or interact with the proteins of the invention and 

1 5 modulate their activity. Such captured proteins are also likely to be involved in the 
propagation of signals by the proteins of the invention as, for example, downstream 
elements of a protein-mediated signaling pathway. 

The two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. Briefly, the 

20 assay utilizes two different DNA constructs. In one construct, the gene that codes for a 
protein of the invention is fused to a gene encoding the DNA binding domain of a 
known transcription factor (e.g., GAL-4). In the other construct, a DNA sequence, from 
a library of DNA sequences, that encodes an unidentified protein ("prey" or "sample") is 
fused to a gene that codes for the activation domain of the known transcription factor. If 

25 the "bait" and the "prey" proteins are able to interact, in vivo, forming an protein- 
dependent complex, the DNA-binding and activation domains of the transcription factor 
are brought into close proximity. This proximity allows transcription of a reporter gene 
(e.g., LacZ) which is operably linked to a transcriptional regulatory site responsive to 
the transcription factor. Expression of the reporter gene can be detected, and cell 
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colonies containing the functional transcription factor can be isolated and used to obtain 
the cloned gene which encodes the protein which interacts with the protein of the 
invention. 

This invention further pertains to novel agents identified by the above-described 
5 screening assays. Accordingly, it is within the scope of this invention to further use an 
agent identified as described herein in an appropriate animal model. For example, an 
agent identified as described herein (e.g., a modulating agent, an antisense nucleic acid 
molecule, a specific antibody, or a protein-binding partner) can be used in an animal 
model to determine the efficacy, toxicity, or side effects of treatment with such an agent. 

10 Alternatively, an agent identified as described herein can be used in an animal model to 
determine the mechanism of action of such an agent. In addition, this invention 
encompasses the use of transgenic animals that express a nucleic acid sequence selected 
from the group consisting of SEQ ID NOS:l and 3, the complement of SEQ ID NOS:l 
and 3, the nucleic acid sequence that encode SEQ ID NOS: 2 and 4. 

15 Furthermore transgenic animals that have SEQ ID NO: 3 or fragment thereof 

deleted are also encompassed within this invention. These transgenic animals can be 
used in multiple methods to assay, for example, modulators of expression of the nucleic 
acid molecules of the invention identified in a method wherein the transgenic animal is 
exposed to a candidate compound and the expression of appropriate mRNA or protein 

20 in the cell is determined. The level of expression of appropriate mRNA or protein in the 
presence of the candidate compound is compared to the level of expression of mRNA or 
protein in the absence of the candidate compound. The candidate compound can then 
be identified as a modulator of expression based on this comparison. For example, 
when expression of mRNA or protein is greater (statistically significantly greater) in the 

25 presence of the candidate compound than in its absence, the candidate compound is 
identified as a stimulator or enhancer of the mRNA or protein expression. 
Alternatively, when expression of the mRNA or protein is less (statistically significantly 
less) in the presence of the candidate compound than in its absence, the candidate 
compound is identified as an inhibitor of the mRNA or protein expression. The level of 
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mRNA or protein expression in the cells can be determined by methods described herein 
for detecting mRNA or protein and as is well known in the art. 

Furthermore, this invention pertains to uses of novel agents identified by the 
above-described screening assays for treatments as described herein. 

5 Methods of Treatment 

The present invention provides for both prophylactic and therapeutic methods of 
treating a subject at risk of (or susceptible to) a disorder or having a disorder associated 
with aberrant expression or activity of proteins or nucleic acids of the invention. 

Disorders for treatment include, but not limited to, autoimmune diseases such as 

10 arthritis (e.g., rheumatoid arthritis, juvenile rheumatoid arthritis, psoriatic arthritis), 
multiple sclerosis, systemic lupus erythematosus, myasthenia gravis, juvenile onset 
diabetes, diabetes mellitus, nephritides such as glomerulonephritis, autoimmune 
thyroiditis, Behcet's disease; inflammatory or allergic diseases and conditions, including 
respiratory allergic diseases such as asthma, allergic rhinitis, hypersensitivity lung 

15 diseases, hypersensitivity pneumonitis, interstitial lung diseases (ILD) (e.g., idiopathic 
pulmonary fibrosis, or ILD associated with rheumatoid arthritis, systemic lupus 
erythematosus, ankylosing spondylitis, systemic sclerosis, Sjogren's syndrome, 
polymyositis or dermatomyositis); chronic obstructive pulmonary disease; anaphylaxis 
or hypersensitivity responses, drug allergies (e.g., to penicillin, cephalosporins), insect 

20 sting allergies; inflammatory bowel diseases, such as Crohn's disease and ulcerative 
colitis; spondyloarthropathies; scleroderma; psoriasis and inflammatory dermatoses 
such as dermatitis, eczema, atopic dermatitis, allergic contact dermatitis, urticaria; 
vasculitis (e.g., necrotizing, cutaneous, and hypersensitivity vasculitis), graft rejection 
(e.g., in transplantation), including allograft rejection or graft-versus-host disease, and 

25 organ transplant-associated arteriosclerosis; cancer, tuberculosis, malaria and HIV 
infection. 

With regard to both prophylactic and therapeutic methods of treatment, such 
treatments may be specifically tailored or modified, based on knowledge obtained from 
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the field of pharmacogenomics. "Pharmacogenomics", as used herein, refers to the 
application of genomics technologies such as gene sequencing, statistical genetics, and 
gene expression analysis to drugs in clinical development and on the market. More 
specifically, the term refers the study of how a patients genes determine his or her 
5 response to a drug (e.g., a patient's "drug response phenotype", or "drug response 
genotype".) Thus, another aspect of the invention provides methods for tailoring an 
individual's prophylactic or therapeutic treatment with the molecules of the present 
invention or modulators according to that individual's drug response genotype. 
Pharmacogenomics allows a clinician or physician to target prophylactic or therapeutic 
10 treatments to patients who will most benefit from the treatment and to avoid treatment 
of patients who will experience toxic drug related side effects. 

1. Prophylactic Methods 

In one aspect, the invention provides a method for preventing in a subject, a 
disease or condition associated with aberrant expression or activity of genes or proteins 

1 5 of the present invention, by administering to the subject an agent which modulates 

expression or at least one activity of a gene or protein of the invention. Subjects at risk 
for a disease which is caused or contributed to by aberrant gene expression or protein 
activity can be identified by, for example, any or a combination of diagnostic or 
prognostic assays as described herein. Administration of a prophylactic agent can occur 

20 prior to the manifestation of symptoms characteristic of the aberrancy, such that a 

disease or disorder is prevented or, alternatively, delayed in its progression. Depending 
on the type of aberrancy, for example, an agonist or antagonist agent can be used for 
treating the subject. The appropriate agent can be determined based on screening assays 
described herein. The prophylactic methods of the present invention are further 

25 discussed in the following subsections. 
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2. Therapeutic Methods 

Another aspect of the invention pertains to methods of modulating expression or 
activity of genes or proteins of the invention for therapeutic purposes. The modulatory 
method of the invention involves contacting a cell with an agent that modulates one or 
5 more of the activities of the specified protein associated with the cell. An agent that 
modulates protein activity can be an agent as described herein, such as a nucleic acid or 
a protein, a naturally-occurring target molecule of a protein described herein, a 
polypeptide, a peptidomimetic, or other small molecule. In one embodiment, the agent 
stimulates one or more protein activities. Examples of such stimulatory agents include 

10 active protein as well as a nucleic acid molecule encoding the protein that has been 
introduced into the cell. In another embodiment, the agent inhibits one or more protein 
activities. Examples of such inhibitory agents include antisense nucleic acid molecules 
and anti-protein antibodies. These modulatory methods can be performed in vitro (e.g., 
by culturing the cell with the agent) or, alternatively, in vivo (e.g., by administering the 

15 agent to a subject). As such, the present invention provides methods of treating an 
individual afflicted with a disease or disorder characterized by aberrant expression or 
activity of a protein or nucleic acid molecule of the invention. In one embodiment, the 
method involves administering an agent (e.g., an agent identified by a screening assay 
described herein), or combination of agents that modulates (e.g., upregulates or 

20 downregulates) expression or activity of a gene or protein of the invention. In another 
embodiment, the method involves administering a protein or nucleic acid molecule of 
the invention as therapy to compensate for reduced or aberrant expression or activity of 
the protein or nucleic acid molecule. 

Stimulation of protein activity is desirable in situations in which the protein is 

25 abnormally downregulated and/or in which increased protein activity is likely to have a 
beneficial effect. Likewise, inhibition of protein activity is desirable in situations in 
which the protein is abnormally upregulated and/or in which decreased protein activity 
is likely to have a beneficial effect. One example of such a situation is where a subject 
has a disorder characterized by aberrant development or cellular differentiation. 
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Another example of such a situation is where the subject has a proliferative disease 
(e.g., cancer) or a disorder characterized by an aberrant hematopoietic response. 

3. Pharmacogenomics 

The molecules of the present invention, as well as agents, or modulators which 
5 have a stimulatory or inhibitory effect on the protein activity (e.g., gene expression) as 
identified by a screening assay described herein can be administered to individuals to 
treat (prophylactically or therapeutically) disorders (e.g., proliferative or developmental 
disorders) associated with aberrant protein activity. In conjunction with such treatment, 
pharmacogenomics (i.e., the study of the relationship between an individual's genotype 

10 and that individual's response to a foreign compound or drug) may be considered. 
Differences in metabolism of therapeutics can lead to severe toxicity or therapeutic 
failure by altering the relation between dose and blood concentration of the 
pharmacologically active drug. Thus, a physician or clinician may consider applying 
knowledge obtained in relevant pharmacogenomics studies in determining whether to 

15 administer a molecule of the invention or modulator thereof, as well as tailoring the 
dosage and/or therapeutic regimen of treatment with such a molecule or modulator. 

Pharmacogenomics deals with clinically significant hereditary variations in the 
response to drugs due to altered drug disposition and abnormal action in affected 
persons. See e.g., Eichelbaum, M, Clin Exp Pharmacol Physiol, (1996) 23(10- 

20 77J:983-985 and Linder, M.W., Clin. Chem. (1997) 43(2)254-266. In general, two 
types of pharmacogenetic conditions can be differentiated. Genetic conditions 
transmitted as a single factor altering the way drugs act on the body (altered drug action) 
or genetic conditions transmitted as single factors altering the way the body acts on 
drugs (altered drug metabolism). These pharmacogenetic conditions can occur either as 

25 rare genetic defects or as naturally-occurring polymorphisms. For example, glucoses- 
phosphate dehydrogenase deficiency (G6PD) is a common inherited enzymopathy in 
which the main clinical complication is haemolysis after ingestion of oxidant drugs 
(anti-malarials, sulfonamides, analgesics, nitrofurans) and consumption of fava beans. 
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One pharmacogenetics approach to identifying genes that predict drug 
response, known as "a genome-wide association", relies primarily on a high-resolution 
map of the human genome consisting of already known gene-related markers (e.g., a 
"bi-allelic" gene marker map which consists of 60,000-100,000 polymorphic or variable 
5 sites on the human genome, each of which has two variants). Such a high-resolution 
genetic map can be compared to a map of the genome of each of a statistically 
significant number of patients taking part in a Phase II/III drug trial to identify markers 
associated with a particular observed drug response or side effect. Alternatively, such a 
high resolution map can be generated from a combination of some ten-million known 

10 single nucleotide polymorphisms (SNPs) in the human genome. As used herein, a 
"SNP" is a common alteration that occurs in a single nucleotide base in a stretch of 
DNA. For example, a SNP may occur once per every 1,000 bases of DNA. A SNP may 
be involved in a disease process, however, the vast majority may not be disease- 
associated. Given a genetic map based on the occurrence of such SNPs, individuals can 

1 5 be grouped into genetic categories depending on a particular pattern of SNPs in their 
individual genome. In such a manner, treatment regimens can be tailored to groups of 
genetically similar individuals, taking into account traits that may be common among 
such genetically similar individuals. 

Alternatively, a method termed the "candidate gene approach", can be utilized to 

20 identify genes that predict drug response. According to this method, if a gene that 

encodes a drug's target is known (e.g., a protein or a receptor of the present invention), 
all common variants of that gene can be fairly easily identified in the population and it 
can be determined if having one version of the gene versus another is associated with a 
particular drug response. 

25 As an illustrative embodiment, the activity of drug metabolizing enzymes is a 

major determinant of both the intensity and duration of drug action. The discovery of 
genetic polymorphisms of drug metabolizing enzymes (e.g., N-acetyltransferase 2(NAT 
2) and cytochrome P450 enzymes CYP2D6 and CYP2C19) has provided an explanation 
as to why some patients do not obtain the expected drug effects or show exaggerated 
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drug response and serious toxicity afer taking the standard and safe dose of a drug. 
These polymorphisms are expressed in two phenotypes in the population, the extensive 
metabolizer (EM) and poor metabolizer (PM). The prevalence of PM is different 
among different populations. For example, the gene coding for CYP2D6 is highly 
5 polymorphic and several mutations have been identified in PM, which all lead to the 
absence of functional CYP2D6. Poor metabolizers of CYP2D6 and CYP2C19 quite 
frequently experience exaggerated drug response and side effects when they receive 
standard doses. If a metabolite is the active therapeutic moiety, PM show no therapeutic 
response, as demonstrated for the analgesic effect of codeine mediated by its CYP2D6- 

10 formed metabolite morphine. The other extreme are the so called ultra-rapid 

metabolizers who do not respond to standard doses. Recently, the molecular basis of 
ultra-rapid metabolism has been identified to be due to CYP2D6 gene amplification. 

Alternatively, a method termed the "gene expression profiling", can be utilized 
to identify genes that predict drug response. For example, the gene expression of an 

15 animal dosed with a drug (e.g., a molecule or modulator of the present invention) can 
give an indication whether gene pathways related to toxicity have been turned on. 

Information generated from more than one of the above pharmacogenomics 
approaches can be used to determine appropriate dosage and treatment regimens for 
prophylactic or therapeutic treatment of an individual. This knowledge, when applied to 

20 dosing or drug selection, can avoid adverse reactions or therapeutic failure and thus 
enhance therapeutic or prophylactic efficiency when treating a subject with a molecule 
or modulator of the invention, such as a modulator identified by one of the exemplary 
screening assays described herein. 

EXAMPLES 
25 Materials and Methods 
Animals 

N15tg RAG-2"'- TCR H-2 b and N15tg RAG-2" A TCR H-2 d mice were generated 
as described (Ghendler et al, 1998). C57BL/6 and TCRCyt5CC7-I-RAG-2 A mice were 
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obtained from Taconic. Mice were maintained and bred under sterile barrier conditions 
at the animal facility of the Dana-Farber Cancer Institute. 

Representational Difference Analysis (RDA) 

Thymii were dissected from 3 week old N15tg RAG-2"'" TCR H-2 b mice 1 h after 
5 i.v. injection of 24 |ig VSV8 peptide or from untreated 3 week old N15tg RAG-2" 7 " TCR 
H-2 d mice. The thymii were dissociated in PBS-1% BSA thymocytes washed in PBS- 
2% FCS and stained with anti-CD4 mAb (BD Pharmingen). In these class I-restricted 
transgenic animals, all CD4+ cells are DP. DP thymocytes were sorted on a MoFlo 
(Cytomation) into ice cold PBS-2%FCS and RNA prepared using guanidium 

10 isothiocyanate as described (Lerner et al, 1996). The RNA was polyA-selected using 
the polyA Spin mRNA Isolation Kit (New England Biolabs). This mRNA was then used 
for RDA (Lisitsyn et ai, 1993; Hubank and Schatz, 1994). Due to limitations in the 
amount of mRNA, a subtracted library in the pZErO-1 vector (Invitrogen) using 
difference product 2 was prepared instead of difference product 3. For this purpose, 

15 difference product 2 was size fractionated on an agarose gel; 6 gel slices containing 
DNAs of different sized-fragments were eluted and ligated to pZErO-1. DNA from one 
gel slice containing an obvious PCR band was ligated, transformed and screened with a 
full length nur77 probe. DNA was prepared from nur77 negative clones for sequencing 
and use as probes in Northern blot analysis. 

20 Constructs 

Full length IkBNS was obtained by screening a cDNA library in AZAP Express 
(Stratagene) prepared from the thymii of C57BL/6 mice sacrificed 2, 4 and 6 h after 
injection of 200 |ig anti-CD3e mAb. After in vivo excision, the cDNA was in the 
pBKCMV vector. For detection of protein upon transfection into Cos-7 cells, a Flag tag 
25 was added to IkBNS by PCR and the product subcloned into pCR2.1 using the TA 
Cloning Kit (Invitrogen) and then into pcDNA3.1 (Invitrogen) with Kpnl and EcoRV. 
To produce a fusion protein with GST, a BamHI/XhoI fragment from Flag-tagged 
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pCR2.1 IkBNS was ligated to BamHI/XhoI-cut pGEX-4T-l (Amersham Pharmacia 
Biotech). GST-IkBNS was purified following the procedures recommended by 
Amersham Pharmacia Biotech. The superinhibitor IkBcc was used to transfect Cos-7 
cells and to produce a GST fusion protein. For this purpose, IkBoc from CD2 MAD 
5 (Hettmann et ai, 1 999) was PCR'd and subcloned into pCR2. 1 using the TA Cloning 
Kit (Invitrogen) and the EcoRI/Sall fragment subcloned into EcoRI/XhoI cut 
pcDNA3.1. For GST-IkBcc, the EcoRI/Sall fragment of pCR2.1-IicBa was ligated to 
EcoRI/Sall-cut pGEX-4T-l. GST-lKBa protein was purified as described above. 

FTOC 

1 0 For FTOC, pregnant Nl 5tg RAG-2' 7 " TCR H-2 b mice were sacrificed and fetal 

thymii removed at day 15.5 with the observation of vaginal plug as day 1 . Fetal thymii 
were cultured in AIM-V media (Invitrogen) in Transwell dishes (Costar) in a humidified 
atmosphere with 5% C02 for 5 days. On day 5, peptides were added to 10 \iM. 
Thymocytes were harvested 2 h after peptide addition and RNA prepared using the 

1 5 Qiagen RNeasy Mini kit according to the manufacturer's protocol (Qiagen). 

Antisense oligonucleotide blockade 

FTOC were set up as described. On day 4, phosphorothioate oligonucleotides 
(sense, 5'CCCCTGGTGATGGAGGACTCT3\ or antisense, 

5'AGAGTCCTCCATCACCAGGGG3' from MWG Biotech, Inc.) were added at 200 
20 |ig/ml. After 12-19 h, VSV8 peptide was added to some FTOC at 300 |iM. After 4 more 
hours, thymic lobes were harvested and analyzed by FACS. Thymocytes were stained at 
~5 x 10 6 cells per ml in PBS-2% FCS-0.05% NaN3 containing the antibodies at 
saturating concentrations. The antibodies were anti-CD8oc-FITC (53-6.7) and anti-CD4- 
PE (RM4.5) from Pharmingen. The phenotypes and proportions of thymocyte subsets 
25 were analyzed by two-color flow cytometry using a FACScan (Becton Dickinson) and 
the CellQuest program. Dead cells were excluded by gating. 
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Northem analysis 

Total thymic RNA was prepared from mice or cultures treated as indicated in 
Figures 3B and 3C using guanidium isothiocyanate (Lerner et aL, 1996). FTOC RNA 
was prepared as described above. RNA was eletrophoresed on 1% 
5 agarose/formaldehyde gels and transferred to Biotrans nylon membranes (ICN). Blots 
were hybridized to 1 x 10 6 cpm/ml probe at 65°C in 7% SDS/0.5M NaP04 pH 
7.2/1 %BS A, washed and autoradiographed. Probes were labeled using the Random 
Prime Labeling Kit (Roche Biochemicals). 

Peptides 

10 SEV9 5 PCC, VSV8 and VSV8 variant peptides were synthesized by standard 

solid phase methods on an Applied Biosystems 430A synthesizer (Sasada et aL, 2000). 
For in vivo injections, 24 |ig peptide in PBS was injected intravenously into mice and 
the mice sacrificed at the indicated times. 

Transfections and luciferase assays 

15 Cos-7 cells were transfected using the calcium phosphate method (Turner et al t 

1990). Cos-7 cells were plated at 0.5 X 10 6 cells/well in 6 well dishes and transfected 
the following day with 0.2 \ig pRL-null (Promega) plus 5 |ag (kB)3 luciferase plasmid 
(Plaisance et aL, 1997). In addition, 10 (ig of pcDNA3.1 or pcDNA3.1-IicBNS or 
pcDNA3.1-lKBcx were co-transfected with the two luciferase plasmids. 24 h after 

20 transfection cells were left untreated or activated with 50 ng/ml PMA for 7 h. Cell 

lysates were then prepared using the Dual-Luciferase Reporter Assay System (Promega) 
and luciferase activity measured on a Monolight 2010 luminometer (Analytical 
Luminescence Laboratory). The pRL-null vector served as an internal control for 
transfection (Behre et aL, 1999). 
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Electrophoretic mobility shift assays 

Nuclear and cytoplasmic extracts were prepared as previously described 
(Schreiber et al y 1989) from the thymus of C57BL/6 mice or N15tg RAG-2" 7 " TCR H-2 b 
mice treated as described in the text. Protein concentrations were determined by 
5 Coomassie Protein Assay Reagent. NF-kB (MWG-Biotech, Inc.) and AP-1 (Santa Cruz 
Biotechnology) double stranded probes were kinased and used to determine binding 
activity. 4 \xg nuclear extracts were incubated for 30 min at room temperature with 1 x 
10 4 counts of probe in 20 \xl binding buffer (20mM Tris pH 7.5, 0.1M NaCl, 2mM DTT, 
ImM EDTA, lmg/ml BSA, 0.1% NP-40, 4% glycerol) containing 1 |ig poly(dldC) 
10 (Amersham Pharmacia Biotech). DNA-protein complexes were resolved by 

electrophoresis on 5% (30:1.2; acrylamide:bis) polyacrylamide gels in 0.5X TBE. The 
gels were then dried and autoradiographed. Where indicated, 1 \ig GST, GST- IkBNS, 
or GST-IkBcc were added to the thymic lysates and incubated 30 min on ice prior to 
addition of the probe. 

15 Protein interaction assays 

To examine the association of IkBNS with NF-kB family members, murine NF- 
kB family members p50 and p65, separately or together, were in vitro translated in the 
presence of [35 S] cysteine/methionine (NEN Life Science Products) using TNT-coupled 
reticulocyte lysate systems (Promega). In v/Yro-translated proteins were incubated with 

20 5 fig of GST fusion proteins precoupled to Glutathione Sepharose 4B beads (Amersham 
Pharmacia Biotech) for 2 h at 4°C in 0.5 ml incubation buffer (PBS/0.1% Triton X- 
100/1 mM PMSF). After washing, associated proteins were analyzed by 10% SDS- 
PAGE followed by autoradiography. For analysis of IkBNS -interacting NF-kB proteins 
in the thymus, mice were injected i.v. with VSV8 peptide or left untreated and 

25 subsequently sacrificed at various time points. Single cell suspensions of thymocytes 
were prepared as described above (RDA). For cytosolic extracts, thymocytes were 
resuspended in 250 \il of buffer A (10 mM Hepes pH 7.9/1.5 mM MgC12/10 mM 
KC1/0.5 mM DTT/0.2 mM PMSF), incubated for 10 min on ice and centrifuged (8000 
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rpm, 8 min, 4°C). Supernatants were collected as cytosolic extracts while pellets were 
resuspended in 250 |il of buffer C (20 mM Hepes pH 7.9/25% Glycerol/420mM 
NaCl/1,5 mM MgC12/ 0.2 mM EDTA/0.5 mM DTT/0.2 mM PMSF/ 10 [ig/ml each of 
aprotinin and leupeptin). After incubation for 20 min on ice, extracts were centrifuged 
5 (14000 rpm, 15 min, 4°C) and supernatants were collected as nuclear extracts. Protein 
concentrations were determined using the Micro BCA Protein Assay Reagent Kit. 
Extracts were adjusted to 20 mM Hepes pH 7.9/150 mM NaCl/0.1% Triton X-100/1 
mM PMSF (final volume 750 and precleared with Glutathione Sepharose 4B beads 
for 1 h at 4°C. Subsequently, extracts were incubated with GST-, GST-IkBNS-, and 
10 GST-IkBoc- beads for 1 h at 4°C. Finally, beads were washed with 20 mM Hepes pH 
7.9/150 mM NaCl/0.1% Triton X-100/1 mM PMSF, boiled in standard reducing sample 
buffer and analyzed by Western blot with the indicated antibodies (p52, p65, c-Rel and 
RelB antibodies from Santa Cruz; p50 antibody from Stressgen). 

Example 1 

15 cDNA cloning ofltcBNS 

Homozygous N15 TCRtg RAG-2 7 " H-2 b mice bearing a TCR specific for the 
VSV8 peptide bound to the H-2 K b MHC class I molecule were previously established 
to investigate processes of negative selection (Clayton et al, 1997; Ghendler et al, 
1998; Sasada et al, 2000). Given that these mice express only a single TCR, the 

20 consequences of parenteral injection of the VSV8 peptide on thymocyte fate at various 
stages of development could be readily examined. These studies showed that DP 
thymocytes underwent apoptosis within 3 h of a single 24 |ig VSV8 administration 
(Clayton et al, 1997). Consequently, for cDNA cloning purposes, DP thymocytes were 
isolated by fluorescence activated cell sorting from the thymii of N15tg RAG-2 7 " TCR 

25 transgenic mice on an H-2 b or H-2 d MHC background. The Nl 5tg RAG-2" /_ H-2 b mice 
were injected 1 h previously with VSV8 peptide to induce negative selection. The 
resulting RNAs from DP thymocytes were used to perform RDA (Lisitsyn et al, 1993; 
Hubank and Schatz, 1994) with the VSV8-stimulated N15tg RAG-2' A H-2 b RNA as the 
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tester and the unstimulated N15tg RAG-2"'" H-2 d RNA as the driver. The mRNA pool 
from the N15tg RAG-2' A H-2 d mice is representative of DP thymocyte transcripts, but 
importantly, lacks those mRNAs induced via TCR activation since the N15 TCR cannot 
be triggered on the H-2 d background (Figure 1). The mRNA pool of the N15tg RAG-2* 
5 /_ H-2 b mice 1 h after VSV8 peptide injection contains the same DP thymocyte mRNAs 
and, in addition, mRNAs induced during TCR-triggered negative selection. Thus, RDA 
using these two DP thymocyte populations will preferentially amplify cDNA copies of 
mRNAs induced during the process of negative selection. The use of an early time 
point and sorted thymocytes obviates any RNA contribution from thymic stromal 

10 elements activated as a consequence of molecular crosstalk between thymocytes and 
nonlymphoid components as observed previously (Lerner et oi, 1996). 

PCR products were size fractionated and subcloned to create a subtracted library. 
Of these inserts, 80% hybridized to nur77 and were excluded from further analysis. 
Plasmid DNAs were prepared from randomly chosen colonies of the remaining 20% and 

1 5 inserts used to probe Northern blots of total thymus RNA from Nl 5tg RAG-2' A H-2 d 
and VSV8-injected N15tg RAG-2" 7 " H-2 b mice. One insert of three tested showed strong 
induction in the VSV8-injected N15tg RAG-2 7 " H-2 b RNA but did not hybridize to the 
N15tg RAG-2 7 " H-2 d RNA (see below). Re-analysis of the nur77-unrelated clones in the 
subtracted library showed that half hybridized with this novel cDNA. For further 

20 characterization, the 176 bp insert was used to screen a thymic cDNA library and the 
full length clone obtained. 

Example 2 

Comparison with known IkB proteins 

The predicted protein sequence (SEQ ID NO: 4) of the full-length cDNA 
25 product (SEQ ID NO: 3) is shown in Figure 2. This sequence is highly homologous to 
members of the IkB family of proteins as aligned in Figure 2. Therefore, this gene 
product has been termed IkBNS. The clone encodes a 327 amino acid protein 
containing 7 ankyrin domains (labeled A-G, Figure 2), 4 potential protein kinase C 
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phosphorylation sites ([S/T]-X-[R/K] at aa 20-22, 43-45, 220-222 and 289-291) and one 
potential casein kinase II phosphorylation site ([S/T]-X(2)-[D/E] at aa 214-217). IkBNS 
does not contain a PEST sequence as is found in IkBoc and involved in the control the 
basal turnover of IkBcc protein levels (Verma et al, 1995). The IkBNS protein 
5 sequence has highest similarity (43%) with "molecule possessing ankyrin-repeats 
induced by lipopolysaccharide" (MAIL or IkB£), an LPS-inducible IkB protein 
(Kitamura et al, 2000; Yamazaki et al, 2001). Other IkB family members have 29- 
39% similarity by BLAST search. The Figure 2 dendrogram shows that next to MAIL 
(IkB()> human Bcl-3 is most similar. Thus, a new IkB family member has been 
10 identified. 

A comparison of the ankyrin domains of six IkB family members is presented in 
Figure 2. Ankyrin repeats span over 33 residues and are highly divergent (Bork, 1993). 
However, a basic core motif consisting of two antiparallel a-helices connected by a 
tight-hairpin loop is a conserved key feature. Tandem arrays of ankyrin repeats give rise 

15 to right-handed superhelixes, where one helix of the core motif (inner helix) interacts 
with the next inner helix of the following repeat (Gorina and Pavletich, 1996; Huxford 
et al, 1998; Jacobs and Harrison, 1998; Mandiyae/ al, 1999; Zhang and Peng, 2000). 
When an alignment using the entire IkBNS protein sequence is performed, the first 6 of 
7 ankyrin domains in IkBNS align with ankyrin domains 1 through of the structurally 

20 characterized IkB protein, IkBoc (lnfi) (Jacobs and Harrison, 1998). Repeats B, C, E, F 
are the most conserved whereas the N-terminal (A) and C-terminal (G) repeats are much 
less conserved. Ankyrin repeats do not behave as independent folded units but associate 
in tandem to provide stabilizing interactions (McDonald and Peters, 1998; Zhang and 
Peng, 2000). In this context, the C-terminal and N-terminal ankyrin repeats interact 

25 only with one neighboring repeat, and therefore have less evolutionary constraints 

imposed upon them. Consequently, these terminal repeats are more prone to amino acid 
sequence substitutions. 

Ankyrin repeat D of IkBNS deserves special attention. IkBNS and MAIL 
(IkBQ have an insertion in ankyrin repeat D that lies within the tight P-turn of the 
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ankyrin repeats of plOO, pl05, Bcl-3 and IxBa (Figure 2). Secondary structure 
predictions indicate that IkBNS has an extended outer helix rather than a long loop 
insertion between the inner and outer helices of the ankyrin repeat. If this is the case, 
the packing of the ankyrin repeats would be interrupted at this point, and a second 
5 tandem of ankyrin repeats would then start at ankyrin repeat E and terminate with 
ankyrin repeat G. 

Control of IkBcx protein levels is mediated through phosphorylation- 
ubiquitination processes resulting in its degradation. The DSGL[D/G/E]S (SEQ ID 
NO:5) motif found in IkBcx, P and e is not present in IkBNS . Furthermore, there are no 
10 lysines equivalent to those in IkBcx which can serve as targets for ubiquitination (Karin 
and Ben-Neriah, 2000) implying that IkBNS is insensitive to this degradation pathway. 
In addition, control of the basal IkBcx protein level is through a PEST sequence located 
in the carboxy terminal portion (Karin and Ben-Neriah, 2000) and not found in IkBNS. 

Example 3 

1 5 Human homolog 

A blast search using the 176 bp insert of the original RDA clone revealed high 
similarity with a human genomic sequence in Genbank. This sequence consists of 
39,163 bp from human chromosome 19 and contains APLP1, a transmembrane 
glycoprotein related to Alzheimer disease associated amyloid beta protein precursor. 

20 The full length clone of IkBNS also matches this human chromosome 19 sequence in 
discontinuous stretches over a large area (from ~ bp 23,000 to bp 33,500 in the antisense 
direction relative to APLP1) suggesting the presence of a homologous human gene with 
7 or more exons, (SEQ ID NO: 1). Comparison of the murine IkBNS protein sequence 
(SEQ ID NO: 4) with that of the hypothetical (predicted) human protein (SEQ ID NO: 

25 2) shows 91% identity. Hence, a human IkBNS gene exists and likely serves a 
comparable function to the mouse counterpart. 
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Example 4 

Analysis of IkBNS mRNA expression 

Using the 176 bp RDA product as a probe, we examined the tissue distribution 
of this gene as shown in Figure 3A. While faint hybridization to spleen mRNA was 
5 noted, heart, brain, lung, liver, skeletal muscle, kidney and testis were negative. In 
addition, very low levels can be detected in RNA from C57BL/6 thymus (Figure 3B, 
right panel) perhaps as a consequence of ongoing negative selection. However, in 
N15tg RAG-2" 7 " H-2 d or N15tg RAG-2 7 " H-2 b animals, IkBNS message is not detected in 
the thymus of control animals but is apparent within 1 h of i.v. injection of VSV8 

1 0 peptide into Nl 5tg RAG-2' 7 ' H-2 b mice (Figure 3B, left panel). Similarly, IkBNS 
message is strongly induced within 1 h of pigeon cytochrome c (PCC) peptid (aa 88- 
104) injection into TCRCyt5CC7-I-RAG-2' 7 ' H-2 a mice which bear a TCR recognizing 
PCC bound to the I-Ek class II MHC molecule (Seder et al 9 1992) (Figure 3B, middle 
panel). Under these conditions PCC peptide administration induces negative selection 

15 of DP thymocytes in TCRCyt5CC7-I-RAG-2' 7 ' H-2 a mice. Note the absence of IkBNS 
message in the basal state. Collectively, the results demonstrate that induction of 
negative selection in both class I- and class II-restricted TCR transgenic animals results 
in increased levels of IkBNS mRNA. This message is also induced in thymic RNA 
isolated from C57BL/6 mice treated in vivo with anti-CD3e monoclonal antibody, a 

20 treatment known to induce death in DP thymocytes (Smith et ai, 1989). While 

apoptotic death of immature thymocytes is also induced by glucocorticoids (Wyllie, 
1980) and y-irradiation (Sellins and Cohen, 1987), thymic RNA from mice treated with 
0.5 mg dexamethasone or 500 rads show essentially no induction of IkBNS when 
mRNA levels are normalized with GAPDH message (Figure 3B). Thus IkBNS is 

25 induced only by TCR-triggered events in immature thymocytes 

Analyses of the N15tg RAG-2" 7 " H-2 b mice using variant peptide ligands have 
demonstrated that the VSV8 peptide p4 position is critical in determining thymic 
selection outcomes (Ghendler et aL, 1998; Sasada et aL, 2000). Hence, small changes 
in the peptide there can be sensed by the TCR. While VS V8 or the isoleucine variant 
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CM) induce negative selection in FTOC using N15tg RAG-2" 7 * H-2 b mice, leucine (L4) or 
norvaline (Nor4) substitutions result in positive selection (Clayton et aL, 1997; 
Ghendler et al s 1998; Sasada etaL, 2000). Using this information, induction of IkBNS 
in N15tg RAG-2" A H-2 b FTOC treated with these peptides was analyzed. SEV9, a Sendai 

5 virus-derived peptide that binds Kb with an affinity equal to that of VSV8 but is not 
recognized by the N15 TCR serves as a control peptide. As shown in Figure 3C, the 
negatively selecting peptides, VSV8 and 14, result in IkBNS mRNA induction within 2h 
of addition to N15tg RAG-2" 7 " H-2 b FTOC. In contrast, the L4 and Nor4 peptides that 
induce positive selection and the control non-selecting SEV9 peptide do not upregulate 

10 steady state expression of the IkBNS mRNA. Induction of IkBNS mRNA therefore 
correlates with the process of negative selection in thymocytes. 

Example 5 

NF-kB inhibition by IkBNS 

The major function of IkBoc is to bind NF-kB in the cytoplasm and prevent its 

1 5 nuclear translocation and subsequent DNA binding by blocking these sites on NF-kB. 
To determine if IkBNS has a similar inhibitory function, an NF-KB-sensitive luciferase 
reporter in Cos-7 cells cotransfected with IkBNS was assayed. NF-kB activity was 
induced by a 7 h treatment of transfected Cos-7 cells with 50 ng/ml phorbol-12- 
myristate- 13 -acetate (PMA) prior to analysis of luciferase activity. PMA treated Cos-7 

20 cells showed an increase of approximately 5-10 fold in NF-kB luciferase activity 

relative to untreated cells (Figure 4). Cotransfection with an IkBNS construct reduced 
NF-KB-driven luciferase activity to a level approaching that of cells transfected with the 
vector only. Cotransfection of IkBcc with the NF-kB luciferase construct inhibited 
PMA-induced NF-kB luciferase activity almost completely. Thus both IkBNS and 

25 IkBcx reduce NF-KB-sensitive reporter activity in Cos cells. IkBNS also reduced the 
activity of NF-KB-sensitive reporter constructs in transiently transfected Jurkat T cells. 

To further assay IkBNS function, the effects of a chimeric GST-IkBNS fusion 
protein on NF-kB electrophoretic mobility shift assays (EMS As) was tested. NF-kB gel 



1062.2002-004 

t 

» 



-52- 

shift analysis was performed on nuclear extracts from N15tg RAG-2" 7 ' H-2 b mice 1 h 
after VSV8 injection (Figure 5). Addition of GST-IkBNS and GST-IkBoc but not GST 
blocks the gel shift band in N15tg RAG-2"'- H-2 b thymic nuclear extracts. GST-IkBNS 
also blocks NF-kB binding in nuclear lysates of anti-CD3e mAb-treated C57BL/6 
5 animals. Addition of 100X cold NF-kB probe blocks formation of the NF-kB band but 
100X cold AP-1 probe has no effect, demonstrating the specificity of this gel shift. In 
contrast, the GST-fusion proteins have no affect on the AP-1 gel shift, but thelOOX cold 
AP-1 probe inhibits formation of this band. Collectively, these data suggest that IkBNS 
and IkBoc bind the NF-kB components and prevent subsequent DNA binding. 

10 Example 6 

Interaction of IkBNS with NF-kB family members 

To examine the specificity of IkBNS binding in vivo, thymic cytosolic and 
nuclear lysates were prepared from N15tg RAG-2" /_ H-2 b mice 10, 30 or 60 min after i.v. 
VSV8 injection or from uninjected animals and used for Western blot analysis of 

1 5 IkBNS- and iKBcc-interacting proteins. The total cellular levels of p50, p65 and RelB 
proteins are shown in Figure 6A. There is little change in their cytosolic level upon 
VSV8 injection; in contrast, p50, p65 and RelB proteins are barely detectable in the 
nuclear fraction in control animals. Within 10 min of TCR triggering, however, all are 
readily observed in the nucleus. Presumably triggering through the TCR has induced 

20 degradation of cytosolic IkBcx and allowed translocation of these proteins to the nucleus. 
Both c-Rel and p52 show similar patterns of expression to those in Figure 6A although 
p52 is detectable in the nuclear fraction of the 0 time point. 

GST, GST-IkBNS and GST-IkBcc were incubated with the thymic lysates and 
interacting proteins identified by Western blot analysis (Figure 6B). GST protein (Ctl) 

25 does not interact with any of the proteins analyzed. Aside from some basal p50-lKBNS 
interaction, the interaction of GST-IkBNS and GST-IkBoc with p50, p65 and RelB 
proteins is largely dependent on TCR triggering. The presence of endogenous cytosolic 
IkBoc at the 0 time point likely blocks interaction with the GST-Ik B a and GST-IkBNS 
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fusion proteins. In the cytosolic fraction, interaction with p50 and RelB are readily 
detected in the IkBNS pull down experiment at 10, 30 and 60 min. However, no p65 
association is detected at any time point tested even though p65 is present in this 
fraction (Figure 6A) and can interact with IkBcx (Figure 6B). The preference of IkBNS 
5 for p50 and IkBoc for p65 is born out in interaction analysis on in vitro translated 
proteins (Figure 6C). Moreover, GST-IkBoc binds all three proteins in the cytosol 
consistent with its ability to interact with various NF-kB heterodimers. The amount of 
cytosolic p50 bound to GST-IkBNS and GST-IkBoc is approximately equivalent while 
more cytosolic RelB is bound by GST-IkBcc than by GST-IkBNS. The pattern of 
10 interaction with cytosolic c-Rel is exactly like that of RelB for both GST-IkBNS and 
GST-IkBcc. 

Within the nuclear fraction, however, GST-IkBNS displays a surprisingly 
similar pattern of NF-kB binding to that of GST-IkBoc, interacting with p50, p65 and 
RelB although binding more p50. While the amounts of total p50, p65 and RelB 

1 5 proteins are approximately equal in both nuclear and cytoplasmic compartments (Figure 
6A) (aside from the 0 time point where no p65 or RelB is detected in the nuclear 
fraction of control mice), IkBNS binding is more apparent in the nuclear than cytosolic 
fractions. GST-IkBNS binds to p52 only in the nuclear fraction and the pattern of 
interaction with c-Rel is exactly like that of RelB. That the GST-IkBNS interaction 

20 differs between the nuclear and cytosolic fractions is noteworthy. In the nucleus NF-kB 
proteins may be bound to DNA and, hence, the conformation in the presence of DNA 
altered to allow interaction with IkBNS in the nuclear fraction. A second possibility is 
that there may be TCR-triggering-induced modification of NF-kB proteins that allows 
an interaction with GST-IkBNS in the nucleus. A third possibility might involve a 

25 distinct protein(s) within the nucleus that facilitates the interaction of IkBNS with p65 
and RelB (and c-Rel) in this compartment. 
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Example 7 

//cSA^S blockade inhibits antigen-induced negative selection 

To determine the functional consequence of inhibition of IkBNS expression 
during antigen-induced TCR triggering of DP thymocytes, antisense oligonucleotides to 
5 block IkBNS translation in FTOC of N15tg RAG-2'" H-2 b mice were used. A 21 bp 
phosphorothioate antisense oligonucleotide was designed to hybridize to the initiating 
ATG of IkBNS as well as 9 bases in the 5' and 3' directions (Hattori et al, 1996). An 
oligonucleotide consisting of the complimentary 21 bases in the sense direction was 
used as a control. On the fourth day of culture, sense and antisense oligonucleotides 

10 were added to FTOCs at 200 |ag/ml. After 12-19 h, VSV8 peptide was added to some 
FTOC and the culture continued for 4 h. As shown in Figure 7A, the control FTOC 
contains approximately 57% DP thymocytes. Four hours after VSV8 treatment, the 
percentage of DP thymocytes is reduced to 22%. The presence of the IkBNS antisense 
oligonucleotide (AS) prevents deletion of DP cells (52% DP), while the IkBNS sense 

15 oligonucleotide (S) has no effect (21% DP). Although not shown, IkBNS antisense or 
sense oligonucleotides in the absence of VSV8 had no effect on the percentage of DP 
thymocytes in FTOC. Changes in the absolute numbers of total thymocytes as well as 
DP and CD8 SP subpopulations from these FTOC are shown in Figure 7B. VSV8 
induces a 50% reduction in the total number of thymocytes; this loss is confined to the 

20 DP thymocyte compartment as the number of CD8 SP thymocytes remains unchanged. 
Strikingly, the presence of the antisense IkBNS oligonucleotide almost completely 
blocks the antigen-induced loss of thymocytes while the sense oligonucleotide has no 
effect. 
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